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PREFACE 


This third edition, like the first two, is an introduction to the basic methods, theory, 
and applications of differential equations. A knowledge of elementary calculus is 
presupposed. 

The detailed style of presentation that characterized the previous editions of the text 
has been retained. Many sections have been taken verbatim from the second edition, 
while others have been rewritten or rearranged with the sole intention of making them 
clearer and smoother. As in the earlier editions, the text contains many thoroughly 
worked out examples. Also, a number of new exercises have been added, and assorted 
exercise sets rearranged to make them more useful in teaching and learning. 

The book is divided into two main parts. The first part (Chapters 1 through 9) deals 
with the material usually found in a one-semester introductory course in ordinary 
differential equations. This part is also available separately as Introduction to Ordinary 
Differential Equations , Third Edition (John Wiley & Sons, New York, 1980). The 
second part of the present text (Chapters 10 through 14) introduces the reader to 
certain specialized and more advanced methods and provides an introduction to 
fundamental theory. The table of contents indicates just what topics are treated. 

The following additions and modifications are specifically noted. 

1. Material emphasizing the second-order linear equation has been inserted at 
appropriate places in Section 4.1. 

2. New illustrative examples, including an especially detailed introductory one, have 
been written to clarify the Method of Undetermined Coefficients in Section 4.3, 
and a useful table has also been supplied. 

3. Matrix multiplication and inversion have been added to the introductory 
material on linear algebra in Section 7.5. 

4. Additional applications now appear in the text in Sections 3.3 and 7.2. 

5. Section 7.6 is a completely new section on the application of matrix algebra to the 
solution of linear systems with constant coefficients in the special case of two 
equations in two unknown functions. The theory that occupied this section in the 
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second edition now appears in Chapter 1 1 (see note 9 following). We believe that 
this change represents a major improvement for both introductory and inter- 
mediate courses. 

6. Section 7.7 extends the matrix method of Section 7.6 to the case of linear systems 
with constant coefficients involving n equations in n unknown functions. Several 
detailed examples illustrate the method for the case n = 3. 

7. Both revised and new material on the Laplace Transform of step functions, 
translated functions, and periodic functions now appears in Section 9.1. 

8. The basic existence theory for systems and higher-order equations, formerly 
located at the beginning of Chapter 11, has now been placed at the end of 
Chapter 10. This minor change has resulted in better overall organization. 

9. Chapter 11, the Theory of Linear Differential Equations^ has been changed 
considerably. Sections 11.1 through 1 1 .4 present the fundamental theory of linear 
systems. Much of this material was found in Section 7.6 in the second edition, and 
some additional results are also included here. Sections 11.5 through 11.7 now 
present the basic theory of the single nth-order equation, makingconsiderable use 
of the material of the preceding sections. Section 11.8 introduces second-order 
self-adjoint equations and proceeds through the fundamentals of classical Sturm 
Theory. We believe that the linear theory is now presented more coherently than 
in the previous edition. 

10. An appendix presents, without proof, the fundamentals of second and third order 
determinants. 

The book can be used as a text in several different types of courses. The more or less 
traditional onc-semester introductory course could be based on Chapter 1 through 
Section 7.4 of Chapter 7 if elementary applications are to be included. An alternative 
one-semester version omitting applications but including numerical methods and 
Laplace transforms could be based on Chapters 1, 2, 4, 6, 7, 8, and 9. An introductory 
course designed to lead quickly to the methods of partial differential equations could 
be based on Chapters 1, 2 (in part), 4, 6, 12, and 14. 

The book can also be used as a text in various intermediate courses for juniors and 
seniors who have already had a one-semester introduction to the subject. An interme- 
diate course emphasizing further methods could be based on Chapters 8, 9, 12, 13, 
and 14. An intermediate course designed as an introduction to fundamental theory 
could be based on Chapters 10 through 14. We also note that Chapters 13 and 14 can 
be interchanged advantageously. 

I am grateful to several anonymous reviewers who made useful comments and 
suggestions. I thank my colleagues William Bonnice and Robert O. Kimball for helpful 
advice. I also thank my son, Shepley L. Ross, II, graduate student in mathematics, 
University of Rochester, Rochester, New York, for his careful reviewing and helpful 
suggestions. 

I am grateful to Solange Abbott for her excellent typing. I am pleased to record my 
appreciation to Editor Gary Ostedt and the Wiley staff for their constant helpfulness 
and cooperation. 

As on several previous occasions, the most thanks goes to my wife who offered 
encouragement, understanding, patience, and help in many different ways. Thanks, 
Gin. 


Shepley L. Ross 
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CHAPTER ONE 

Differential Equations and Their Solutions 


The subject of differential equations constitutes a large and very important branch of 
modern mathematics. From the early days of the calculus the subject has been an area 
of great theoretical research and practical applications, and it continues to be so in our 
day. This much stated, several questions naturally arise. Just what is a differential 
equation and what does it signify? Where and how do differential equations originate 
and of what use are they? Confronted with a differential equation, what does one do 
with it, how does one do it, and what are the results of such activity? These questions 
indicate three major aspects of the subject: theory, method, and application. The 
purpose of this chapter is to introduce the reader to the basic aspects of the subject and 
at the same time give a brief survey of the three aspects just mentioned. In the course of 
the chapter, we shall find answers to the general questions raised above, answers that 
will become more and more meaningful as we proceed with the study of differential 
equations in the following chapters. 


1.1 CLASSIFICATION OF DIFFERENTIAL EQUATIONS; THEIR ORIGIN AND 
APPLICATION 

A. Differential Equations and Their Classification 
DEFINITION 

An equation involving derivatives of one or more dependent variables with respect to one or 
more independent variables is called a differential equation.* 


* In connection with this basic definition, we do not include in the class of differential equations those 
equations that are actually derivative identities. For example, we exclude such expressions as 


— (e ax ) = ae ax , 
cfx 


d dv du 

— (uv) = u — + v — > 
dx dx cfx 


and so forth. 


3 
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DIFFERENTIAL EQUATIONS AND THEIR SOLUTIONS 


► Example 1.1 

For examples of differential equations we list the following: 


d 2 y f dyV 

d^ +xy {di) =0 - 

(i.i) 

d 4 x _ d 2 x 

(1.2) 

, a T- 5 j- + 3x = sin t, 
dr dt 2 

dv 8v 

(1.3) 

~ds + ~dt =V ’ 

d 2 u d 2 u d 2 u 

(1.4) 

5x 2 + dy 2 dz 2 


From the brief list of differential equations in Example 1.1 it is clear that the various 
variables and derivatives involved in a differential equation can occur in a variety of 
ways. Clearly some kind of classification must be made. To begin with, we classify 
differential equations according to whether there is one or more than one independent 
variable involved. 


DEFINITION 

A differential equation involving ordinary derivatives of one or more dependent variables 
with respect to a single independent variable is called an ordinary differential equation. 


► Example 1 .2 

Equations (1.1) and (1.2) are ordinary differential equations. In Equation (1.1) the 
variable x is the single independent variable, and y is a dependent variable. In Equation 
(1.2) the independent variable is t , whereas x is dependent. 


DEFINITION 

A differential equation involving partial derivatives of one or more dependent variables 
with respect to more than one independent variable is called a partial differential 
equation. 

► Example 1 .3 

Equations (1.3) and (1.4) are partial differential equations. In Equation (1.3) the 
variables s and t are independent variables and v is a dependent variable. In Equation 
(1.4) there are three independent variables: x, y, and z; in this equation u is dependent. 

We further classify differential equations, both ordinary and partial, according to the 
order of the highest derivative appearing in the equation. For this purpose we give the 
following definition. 
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DEFINITION 

The order of the highest ordered derivative involved in a differential equation is called the 
order of the differential equation. 


► Example 1 .4 

The ordinary differential equation (1.1) is of the second order, since the highest 
derivative involved is a second derivative. Equation (1.2) is an ordinary differential 
equation of the fourth order. The partial differential equations (1.3) and (1.4) are of the 
first and second orders, respectively. 

Proceeding with our study of ordinary differential equations, we now introduce the 
important concept of linearity applied to such equations. This concept will enable us to 
classify these equations still further. 


DEFINITION 


A linear ordinary differential equation of order n, in the dependent variable y and the 
independent variable x, is an equation that is in, or can be expressed in, the form 

d n y d n ~ 1 y dy 

+ «iM^r + ••• + + a n (x)y = b(x), 

where a 0 is not identically zero. 


Observe (1) that the dependent variable y and its various derivatives occur to the first 
degree only, (2) that no products of y and/or any of its derivatives are present, and (3) 
that no transcendental functions of y and/or its derivatives occur. 


► Example 1 .5 


The following ordinary differential equations are both linear. In each case y is the 
dependent variable. Observe that y and its various derivatives occur to the first degree 
only and that no products of y and/or any of its derivatives are present. 




(1.5) 


^l + x 2^l + x 3^l 

dx 4 dx 3 dx 


= xe 


(1.6) 


DEFINITION 

A nonlinear ordinary differential equation is an ordinary differential equation that is not 
linear. 
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► Example 1 .6 

The following ordinary differential equations are all nonlinear: 



(1.7) 


(1.8) 


(1.9) 


Equation (1.7) is nonlinear because the dependent variable y appears to the second 
degree in the term 6 y 2 . Equation (1.8) owes its nonlinearity to the presence of the term 
5 (dy/dx) 3 , which involves the third power of the first derivative. Finally, Equation (1.9) 
is nonlinear because of the term 5 y(dy/dx), which involves the product of the 
dependent variable and its first derivative. 

Linear ordinary differential equations are further classified according to the nature 
of the coefficients of the dependent variables and their derivatives. For example, 
Equation (1.5) is said to be linear with constant coefficients , while Equation (1.6) is linear 
with variable coefficients . 


B. Origin and Application of Differential Equations 

Having classified differential equations in various ways, let us now consider briefly 
where, and how, such equations actually originate. In this way we shall obtain some 
indication of the great variety of subjects to which the theory and methods of 
differential equations may be applied. 

Differential equations occur in connection with numerous problems that are 
encountered in the various branches of science and engineering. We indicate a few such 
problems in the following list, which could easily be extended to fill many pages. 

1. The problem of determining the motion of a projectile, rocket, satellite, or planet. 

2. The problem of determining the charge or current in an electric circuit. 

3. The problem of the conduction of heat in a rod or in a slab. 

4. The problem of determining the vibrations of a wire or a membrane. 

5. The study of the rate of decomposition of a radioactive substance or the rate of 
growth of a population. 

6. The study of the reactions of chemicals. 

7. The problem of the determination of curves that have certain geometrical 
properties. 

The mathematical formulation of such problems give rise to differential equations. 
But just how does this occur? In the situations under consideration in each of the above 
problems the objects involved obey certain scientific laws. These laws involve various 
rates of change of one or more quantities with respect to other quantities. Let us re- 
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call that such rates of change are expressed mathematically by derivatives. In the 
mathematical formulation of each of the above situations, the various rates of change 
are thus expressed by various derivatives and the scientific laws themselves become 
mathematical equations involving derivatives, that is, differential equations. 

In this process of mathematical formulation, certain simplifying assumptions 
generally have to be made in order that the resulting differential equations be tractable. 
For example, if the actual situation in a certain aspect of the problem is of a relatively 
complicated nature, we are often forced to modify this by assuming instead an 
approximate situation that is of a comparatively simple nature. Indeed, certain 
relatively unimportant aspects of the problem must often be entirely eliminated. The 
result of such changes from the actual nature of things means that the resulting 
differential equation is actually that of an idealized situation. Nonetheless, the 
information obtained from such an equation is of the greatest value to the scientist. 

A natural question now is the following: How does one obtain useful information 
from a differential equation? The answer is essentially that if it is possible to do so, one 
solves the differential equation to obtain a solution; if this is not possible, one uses the 
theory of differential equations to obtain information about the solution. To 
understand the meaning of this answer, we must discuss what is meant by a solution of 
a differential equation; this is done in the next section. 


Exercises 


Classify each of the following differential equations as ordinary or partial differential 
equations; state the order of each equation; and determine whether the equation under 
consideration is linear or nonlinear. 


1. 

3. 

5. 

7. 

9. 


dy , 

— + x z y = xe x . 
ax 

d 2 u d 2 u _ 
dx 2 + dy 2 

d 4 y +3 fd 2 y' 5 


dx 4 

d 2 y 
dx 

d 6 x 

dt‘ + \dt * 


dx‘< +5y = 0 ■ 


2 + y sin x = 0. 


d 4 x\f d 3 x 


dt 2 


+ x = t. 


2 . 


d 3 y d 2 y 
+ 4- 


dx 


d^—dx^- 5 - + 2y - SinX - 


4. x 2 dy + y 2 dx = 0. 

6 . 


d 4 u d 2 u d 2 u 

dx 2 dy 2 dx 2 dy 2 U 


8 . 

10 . 


d 2 y 

dx 


2 + x sin y = 0. 


dr\ 3 jd 2 r 
ds) yj ds 2 + 


1.2 SOLUTIONS 

A. Nature of Solutions 

We now consider the concept of a solution of the nth-order ordinary differential 
equation. 
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DEFINITION 

Consider the nth-order ordinary differential equation 

f\ 


dy d n yl 

ti? J =0 ’ 


(1-10) 


where F is a real function of its (n + 2) arguments x, y. 


dy d n y 
dx ’ * * * ’ dx n * 


1. Let f be a real function defined for all x in a real interval I and having an nth 
derivative (and hence also all lower ordered derivatives) for all x e I. The function f is 
called an explicit solution of the differential equation (1.10) on I if it fulfills the following 
two requirements : 

F[x,f(x),f'(x),...,f M (x)-] (A) 

is defined for all xe/, and 

FlxJ(x\f'(x),...,fW(xn= 0 (B) 

for all x e I. That is, the substitution of f(x) and its various derivations for y and its 
corresponding derivatives , respectively , in (1.10) reduces (1.10) to an identity on 1. 

2. A relation g(x , y) = 0 is called an implicit solution of (1.10) if this relation defines 
at least one real function f of the variable x on an interval I such that this function is an 
explicit solution of (1.10) on this interval 

3. Both explicit solutions and implicit solutions will usually be called simply solutions. 

Roughly speaking, then, we may say that a solution of the differential equation (1.10) 
is a relation — explicit or implicit — between x and y, not containing derivatives, which 
identically satisfies (1.10). 


► Example 1 .7 


The function / defined for all real x by 


f(x) = 2 sin x + 3 cos x 

(l.n) 

is an explicit solution of the differential equation 


d 2 y 

^ + * ,=o 

(1-12) 


for all real x. First note that / is defined and has a second derivative for all real x. Next 
observe that 

f'(x) = 2 cos x — 3 sin x, 

/"(x) = — 2 sin x — 3 cos x. 

Upon substituting /"(x) for d 2 y/dx 2 and / (x) for y in the differential equation ( 1 . 1 2), it 
reduces to the identity 

( — 2 sin x — 3 cos x) + (2 sin x + 3 cos x) = 0, 
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which holds for all real x. Thus the function / defined by (1.1 1) is an explicit solution of 
the differential equation (1.12) for all real x. 


► Example 1 .8 

The relation 

x 2 -by 2 — 25 = 0 (1.13) 

is an implicit solution of the differential equation 

dy 

x+ y-r = 0 (1.14) 

dx 

on the interval / defined by — 5 < x < 5. For the relation (1.13) defines the two real 
functions f x and f 2 given by 

f l (x) = j25-x 1 

and 

f 2 (x) = —y/25 - X 2 , 

respectively, for all real x e /, and both of these functions are explicit solutions of the 
differential equations (1.14) on /. 

Let us illustrate this for the function f x . Since 

fi(x) = x/25 - x 2 , 

we see that 


/iW 


— x 


x/ 25 - x 2 


for all real x e /. Substituting f x (x) for y and f\(x) for dy/dx in (1.14), we obtain the 
identity 


x 


+ ( N /25 - x 2 ) 



= 0 


or x — x = 0, 


which holds for all real x e /. Thus the function f x is an explicit solution of (1.14) on the 
interval /. 

Now consider the relation 


x 2 + y 2 + 25 = 0. (1.15) 

Is this also an implicit solution of Equation (1.14)? Let us differentiate the relation 
(1.15) implicitly with respect to x. We obtain 


2x + 2y y- = 0 
dx 


or 


dy_ 

dx 


Substituting this into the differential equation (1.14), we obtain the formal identity 



= 0 . 


x + y 
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Thus the relation (1.15) formally satisfies the differential equation (1.14). Can we 
conclude from this alone that ( 1 . 1 5) is an implicit solution of ( 1 . 1 4)? The answer to this 
question is '‘no,” for we have no assurance from this that the relation (1.15) defines any 
function that is an explicit solution of (1.14) on any real interval /. All that we have 
shown is that ( 1 . 1 5) is a relation between x and y that, upon implicit differentiation and 
substitution, formally reduces the differential equation (1.14) to a formal identity. It is 
called a formal solution; it has the appearance of a solution; but that is all that we know 
about it at this stage of our investigation. 

Let us investigate a little further. Solving (1.15) for y, we find that 

y = ±x/-25 - X 2 . 

Since this expression yields nonreal values of y for all real values of x, we conclude 
that the relation (1.15) does not define any real function on any interval. Thus the 
relation (1.15) is not truly an implicit solution but merely a formal solution of the 
differential equation (1.14). 

In applying the methods of the following chapters we shall often obtain relations that 
we can readily verify are at least formal solutions. Our main objective will be to gain 
familiarity with the methods themselves and we shall often be content to refer to the 
relations so obtained as “solutions,” although we have no assurance that these relations 
are actually true implicit solutions. If a critical examination of the situation is required, 
one must undertake to determine whether or not these formal solutions so obtained are 
actually true implicit solutions which define explicit solutions. 

In order to gain further insight into the significance of differential equations and 
their solutions, we now examine the simple equation of the following example. 

► Example 1 .9 

Consider the first-order differential equation 

dy 

-f- = 2x. (1.16) 

dx 

The function f 0 defined for all real x by / 0 (x) = x 2 is a solution of this equation. So also 
are the functions f l9 / 2 , and / 3 defined for all real x by ffx) = x 2 + 1, / 2 (x) = x 2 + 2, 
and / 3 (x) = x 2 + 3, respectively. In fact, for each real number c, the function f c defined 
for all real x by 

f c (x) = x 2 + c (1.17) 

is a solution of the differential equation ( 1 . 1 6). In other words, the formula (1 . 1 7) defines 
an infinite family of functions, one for each real constant c, and every function of this 
family is a solution of (1.16). We call the constant c in (1.17) an arbitrary constant or 
parameter and refer to the family of functions defined by (1.17) as a one-parameter 
faimily of solutions of the differential equation (1.16). We write this one-parameter 
family of solutions as 

y = x 2 + c. (1.18) 

Although it is clear that every function of the family defined by (1.18) is a solution of 
( 1 . 1 6), we have not shown that the family of functions defined by ( 1 . 1 8) includes all of 
the solutions of (1.16). However, we point out (without proof) that this is indeed the 
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case here; that is, every solution of (1.16) is actually of the form (1.18) for some 
appropriate real number c. 

Note. We must not conclude from the last sentence of Example 1 .9 that every first- 
order ordinary differential equation has a so-called one-parameter family of solutions 
which contains all solutions of the differential equation, for this is by no means the case. 
Indeed, some first-order differential equations have no solution at all (see Exercise 7(a) 
at the end of this section), while others have a one-parameter family of solutions plus 
one or more “extra” solutions which appear to be “different” from all those of the 
family (see Exercise 7(b) at the end of this section). 

The differential equation of Example 1 .9 enables us to obtain a better understanding 
of the analytic significance of differential equations. Briefly stated, the differential 
equation of that example defines functions , namely, its solutions. We shall see that this 
is the case with many other differential equations of both first and higher order. Thus 
we may say that a differential equation is merely an expression involving derivatives 
which may serve as a means of defining a certain set of functions: its solutions. Indeed, 
many of the now familiar functions originally appeared in the form of differential 
equations that define them. 

We now consider the geometric significance of differential equations and their 
solutions. We first recall that a real function F may be represented geometrically by a 
curve y - F(x) in the xy plane and that the value of the derivative of F at x, F'(x), may 
be interpreted as the slope of the curve y = F(x) at x. Thus the general first-order 
differential equation 

t=f^- " j9 > 

where / is a real function, may be interpreted geometrically as defining a slope / (x, y) at 
every point (x, y) at which the function / is defined. Now assume that the differential 
equation (1.19) has a so-called one-parameter family of solutions that can be written in 
the form 

y = F(x,c ), (1.20) 

where c is the arbitrary constant or parameter of the family. The one-parameter family 
of functions defined by (1.20) is represented geometrically by a so-called one-parameter 
family of curves in the xy plane, the slopes of which are given by the differential 
equation (1.19). These curves, the graphs of the solutions of the differential equation 
(1.19), are called the integral curves of the differential equation (1.19). 


► Example 1.10 


Consider again the first-order differential equation 



(1.16) 


of Example 1.9. This differential equation may be interpreted as defining the slope 2x at 
the point with coordinates (x, y) for every real x. Now, we observed in Example 1.9 that 
the differential equation (1.16) has a one-parameter family of solutions of the form 

y = x 2 + c, (1.18) 
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where c is the arbitrary constant or parameter of the family. The one-parameter family 
of functions defined by ( 1 . 1 8) is represented geometrically by a one-parameter family of 
curves in the xy plane, namely, the family of parabolas with Equation (1.18). The slope 
of each of these parabolas is given by the differential equation ( 1 . 1 6) of the family. Thus 
we see that the family of parabolas (1.18) defined by differential equation (1.16) is that 
family of parabolas, each of which has slope 2x at the point (x, y) for every real x, and 
all of which have the y axis as axis. These parabolas are the integral curves of the 
differential equation (1.16). See Figure 1.1. 


B. Methods of Solution 

When we say that we shall solve a differential equation we mean that we shall find one 
or more of its solutions. How is this done and what does it really mean? The greater 
part of this text is concerned with various methods of solving differential equations. 
The method to be employed depends upon the type of differential equation under 
consideration, and we shall not enter into the details of specific methods here. 

But suppose we solve a differential equation, using one or another of the various 
methods. Does this necessarily mean that we have found an explicit solution / 
expressed in the so-called closed form of a finite sum of known elementary functions? 
That is, roughly speaking, when we have solved a differential equation, does this 
necessarily mean that we have found a “formula” for the solution? The answer is “no.” 
Comparatively few differential equations have solutions so expressible; in fact, a 
closed-form solution is really a luxury in differential equations. In Chapters 2 and 4 we 
shall consider certain types of differential equations that do have such closed-form 
solutions and study the exact methods available for finding these desirable solutions. 
But, as we have just noted, such equations are actually in the minority and we must 
consider what it means to “solve” equations for which exact methods are unavailable. 
Such equations are solved approximately by various methods, some of which are 
considered in Chapters 6 and 8. Among such methods are series methods, numerical 
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methods, and graphical methods. What do such approximate methods actually yield? 
The answer to this depends upon the method under consideration. 

Series methods yield solutions in the form of infinite series; numerical methods give 
approximate values of the solution functions corresponding to selected values of the 
independent variables; and graphical methods produce approximately the graphs of 
solutions (the integral curves). These methods are not so desirable as exact methods 
because of the amount of work involved in them and because the results obtained from 
them are only approximate; but if exact methods are not applicable, one has no choice 
but to turn to approximate methods. Modern science and engineering problems 
continue to give rise to differential equations to which exact methods do not apply, and 
approximate methods are becoming increasingly more important. 


Exercises 

1. Show that each of the functions defined in Column I is a solution of the 
corresponding differential equation in Column II on every interval a < x < b of 
the x axis. 

i ii 

(a) f(x) = x + 3e~ x ^ + y = x + 1 

(b) f(x) = 2e 3x - 5e Ax ^ - 7 ^ + 12); = 0 

(c) f(x) = e x + lx 1 + 6x + 7 3 -j — 3^ + 2 y = 4x 2 

ax ax 

(d) /W = JTP {l+x2) ^ + 4x ^ + 2y = 0 

2. (a) Show that x 3 -f 3xy 2 = 1 is an implicit solution of the differential equation 

2 xy(dy/dx) + x 2 + y 2 = 0 on the interval 0 < x < 1 . 

(b) Show that 5x 2 j / 2 — 2x 3 y 2 =1 is an implicit solution of the differential 
equation x(dy/dx) + y = x 3 y 3 on the interval 0 < x < f . 

3. (a) Show that every function / defined by 

f(x) = (x 2 + C )e~ 3", 

where c is an arbitrary constant, is a solution of the differential equation 

^ + 3y = 3x 2 e~ 3x . 
dx 

(b) Show that every function / defined by 

f(x) = 2 + ce~ 2x \ 

where c is an arbitrary constant, is a solution of the differential equation 
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4. (a) Show that every function / defined by /(x) = + c 2 e 2x ,wherec! andc 2 

are arbitrary constants, is a solution of the differential equation 


d 2 y _ydy_ 

dx 2 dx 


- 8y = 0. 


(b) Show that every function defined by g{x) = c x e 2x -F c 2 xe lx + c 2 e 2x , where 
c 1 , c 2 , and c 3 are arbitrary constants, is a solution of the differential equation 


d 3 y 

dP 


2 d -h_A^i 

dx 2 dx 


+ 8y = 0. 


5. (a) For certain values of the constant m the function / defined by f(x) = e mx is a 
solution of the differential equation 


d 3 y 
dx 3 


dx 2 


4^+ I2y = 0. 
dx 


Determine all such values of m. 


(b) For certain values of the constant n the function g defined by g(x) = x n is a 
solution of the differential equation 


d 2 ) 


x 3 — + 2x 2 '— 
dx 3 dx 2 


10x^-8y = 0. 
dx 


Determine all such values of n. 

6. (a) Show that the function / defined by f(x) = (lx 2 + 2e 3x + 3)e~ 2x satisfies the 

differential equation 


-r + ly = 6e x + 4xe~ 2x 
dx 


and also the condition /( 0) = 5. 

(b) Show that the function f defined by f(x) = 3e 2x — 2xe Zx — cos 2x satisfies 
the differential equation 


dx 2 dx 


+ 4 y = 


— 8 sin 2x 


and also the conditions that /( 0) = 2 and /'( 0) = 4. 


7. (a) Show that the first-order differential equation 


dy 

dx 


+ | y| + 1 = 0 


has no (real) solutions. 

(b) Show that the first-order differential equation 



— 4y = 0 


has a one-parameter family of solutions of theform f(x) = (x + c) 2 , wherecis 
an arbitrary constant, plus the “extra” solution #(x) = 0 that is not a member 
of this family f(x) = (x + c) 2 for any choice of the constant c. 
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1.3 INITIAL- VALUE PROBLEMS, BOUNDARY- VALUE PROBLEMS, AND 
EXISTENCE OF SOLUTIONS 

A. Initial-Value Problems and Boundary-Value Problems 

We shall begin this section by considering the rather simple problem of the following 
example. 


► Example 1.11 

Problem. Find a solution / of the differential equation 


such that at x = 1 this solution / has the value 4. 


( 1 . 21 ) 


Explanation. First let us be certain that we thoroughly understand this problem. 
We seek a real function / which fulfills the two following requirements: 

1. The function / must satisfy the differential equation (1.21). That is, the function / 
must be such that f'(x) = 2x for all real x in a real interval /. 

2. The function / must have the value 4 at x = 1. That is, the function / must be such 
that /(l) = 4. 

Notation. The stated problem may be expressed in the following somewhat 
abbreviated notation: 


3/(1) = 4. 

In this notation we may regard y as representing the desired solution. Then the 
differential equation itself obviously represents requirement 1, and the statement 
y(l) = 4 stands for requirement 2. More specifically, the notation y(l) = 4 states 
that the desired solution y must have the value 4 at x = 1; that is, y = 4 at x = 1. 

Solution. We observed in Example 1.9 that the differential equation (1.21) has a 
one-parameter family of solutions which we write as 

y = x 2 + c, (1.22) 

where c is an arbitrary constant, and that each of these solutions satisfies requirement 1 . 
Let us now attempt to determine the constant c so that (1.22) satisfies requirement 2, 
that is, y = 4 at x = 1. Substituting x = 1, y = 4 into (1.22), we obtain 4 = 1 + c, and 
hence c = 3. Now substituting the value c = 3 thus determined back into (1.22), we 
obtain 


y = x 2 + 3, 
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which is indeed a solution of the differential equation (1.21), which has the value 4 at 
x = 1. In other words, the function / defined by 

f(x) = x 2 + 3, 

satisfies both of the requirements set forth in the problem. 

Comment on Requirement 2 and Its Notation. In a problem of this type, require- 
ment 2 is regarded as a supplementary condition that the solution of the differential 
equation must also satisfy. The abbreviated notation y(l) = 4, which we used to 
express this condition, is in some way undesirable, but it has the advantages of being 
both customary and convenient. 

In the application of both first- and higher-order differential equations the problems 
most frequently encountered are similar to the above introductory problem in that they 
involve both a differential equation and one or more supplementary conditions which 
the solution of the given differential equation must satisfy. If all of the associated 
supplementary conditions relate to one x value, the problem is called an initial-value 
problem (or one-point boundary-value problem). If the conditions relate to two dif- 
ferent x values, the problem is called a two-point boundary-value problem (or simply a 
boundary-value problem). We shall illustrate these concepts with examples and then 
consider one such type of problem in detail. Concerning notation, we generally employ 
abbreviated notations for the supplementary conditions that are similar to the ab- 
breviated notation introduced in Example 1.11. 

► Example 1.12 


d^y 

dx 




3*1) = 3, 

/d)= -4. 

This problem consists in finding a solution of the differential equation 


which assumes the value 3 at x = 1 and whose first derivative assumes the value —4 at 
x = 1. Both of these conditions relate to one x value, namely, x = 1. Thus this is an 
initial-value problem. We shall see later that this problem has a unique solution. 

► Example 1.13 


^3 
dx 2 


+ y = 0 , 


3*0) = 1, 
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In this problem we again seek a solution of the same differential equation, but this time 
the solution must assume the value 1 at x = 0 and the value 5 at x = tc/2. That is, the 
conditions relate to the two different x values, 0 and tc/2. This is a (two-point) 
boundary-value problem. This problem also has a unique solution; but the boundary- 
value problem 


y( 0) = l, y(n) = 5, 

has no solution at all! This simple fact may lead one to the correct conclusion that 
boundary-value problems are not to be taken lightly! 


We now turn to a more detailed consideration of the initial-value problem for a first- 
order differential equation. 


DEFINITION 


Consider the first-order differential equation 

g- /<*,,), (1-23) 

where f is a continuous function of x and y in some domain * D of the xy plane; and let 
(x 0 ,y 0 ) be a point of D. The initial-value problem associated with {1.23) is to find 
a solution 0 of the differential equation ( 1.23 ), defined on some real interval containing x 0 , 
and satisfying the initial condition 

<t>(x 0 ) = yo- 

In the customary abbreviated notation , this initial-value problem may be written 


y(* 0 ) = y 0 - 

To solve this problem, we must find a function 0 that not only satisfies the differential 
equation (1.23) but that also satisfies the initial condition that it has the value y 0 when x 
has value x 0 . The geometric interpretation of the initial condition, and hence of the 
entire initial-value problem, is easily understood. The graph of the desired solution 0 
must pass through the point with coordinates (x 0 , y 0 ). That is, interpreted geometri- 
cally, the initial- value problem is to find an integral curve of the differential equation 
(1.23) that passes through the point (x 0 , y 0 )- 
The method of actually finding the desired solution 0 depends upon the nature of the 
differential equation of the problem, that is, upon the form of / (x, y). Certain special 
types of differential equations have a one-parameter family of solutions whose 
equation may be found exactly by following definite procedures (see Chapter 2). If the 
differential equation of the problem is of some such special type, one first obtains the 
equation of its one-parameter family of solutions and then applies the initial condition 


* A domain is an open, connected set. For those unfamiliar with such concepts, D may be regarded as the 
interior of some simple closed curve in the plane. 
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to this equation in an attempt to obtain a “particular” solution 0 that satisfies the entire 
initial-value problem. We shall explain this situation more precisely in the next 
paragraph. Before doing so, however, we point out that in general one cannot find 
the equation of a one-parameter family of solutions of the differential equation; 
approximate methods must then be used (see Chapter 8). 

Now suppose one can determine the equation 

g(x,y,c) = 0 (1.24) 

of a one-parameter family of solutions of the differential equation of the problem. 
Then, since the initial condition requires that y = y 0 at x = x 0 , we let x = x 0 and 
y = y 0 in (1.24) and thereby obtain 

g(xo,yo,c) = 0 . 

Solving this for c, in general we obtain a particular value of c which we denote here by 
c 0 . We now replace the arbitrary constant c by the particular constant c 0 in ( 1 .24), thus 
obtaining the particular solution 

g(x> y>c 0 ) = o. 

The particular explicit solution satisfying the two conditions (differential equation and 
initial condition) of the problem is then determined from this, if possible. 

We have already solved one initial-value problem in Example 1.11. We now give 
another example in order to illustrate the concepts and procedures more thoroughly. 


► Example 1.14 


Solve the initial-value problem 


dy x 

dx y ’ 


(1.25) 


y( 3) = 4, 


(1.26) 


given that the differential equation (1 .25) has a one-parameter family of solutions which 
may be written in the form 

X 2 + j; 2 =c 2 < (1.27) 

The condition (1.26) means that we seek the solution of (1.25) such that y = 4 at x = 3. 
Thus the pair of values (3,4) must satisfy the relation (1.27). Substituting x = 3 and 
y = 4 into (1.27), we find 


9 + 16 = c 2 or c 2 = 25. 


Now substituting this value of c 2 into (1.27), we have 


x 2 -f- y 2 = 25. 

Solving this for y, we obtain 

y = ±s/25 — x 2 . 

Obviously the positive sign must be chosen to give y the value +4 at x = 3. Thus the 
function / defined by 


f(x) = V25-X 2 , 


-5 < x < 5, 
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is the sol ution of th e problem. In the usual abbreviated notation, we write this solution 
as y = ^/25 — x 2 . 


B. Existence of Solutions 

In Example 1.14 we were able to find a solution of the initial-value problem under 
consideration. But do all initial-value and boundary-value problems have solutions? 
We have already answered this question in the negative, for we have pointed out that 
the boundary-value problem 


^3 

dx 1 


+ y = 0, 


y(0) = i, 

y{n) = 5, 

mentioned at the end of Example 1.13, has no solution! Thus arises the question of 
existence of solutions: given an initial-value or boundary-value problem, does it 
actually have a solution? Let us consider the question for the initial-value problem 
defined on page 1 7. Here we can give a definite answer. Every initial-value problem that 
satisfies the definition on page 17 has at least one solution. 

But now another question is suggested, the question of uniqueness . Does such a 
problem ever have more than one solution? Let us consider the initial-value problem 

i i - ,■» 

dx y 9 

y( 0) = o. 

One may verify that the functions f x and f 2 defined, respectively, by 

fi(x) = 0 for all real x; 

and 

/ 2 M = (3*) 3/2 > *>0; fi w = 0, x < 0; 

are both solutions of this initial-value problem! In fact, this problem has infinitely many 
solutions! The answer to the uniqueness question is clear: the initial-value problem, as 
stated, need not have a unique solution. In order tb ensure uniqueness, some additional 
requirement must certainly be imposed. We shall see what this is in Theorem 1.1, which 
we shall now state. 


THEOREM 1.1. BASIC EXISTENCE AND UNIQUENESS THEOREM 


Hypothesis. Consider the differential equation 



20 DIFFERENTIAL EQUATIONS AND THEIR SOLUTIONS 


where 

1. The function f is a continuous function of x and y in some domain D of the xy plane , 
and 

2. The partial derivative df/dy is also a continuous function of x and y in D; and let 
(x 0 , yo) be u point in D. 

Conclusion. There exists a unique solution (j) of the differential equation ( 1.28 ), 
defined on some interval \x — x 0 | < h, where h is sufficiently small , that satisfies the 
condition 


<Kx o) = yo- (1-29) 

Explanatory Remarks. This basic theorem is the first theorem from the theory of 
differential equations which we have encountered. We shall therefore attempt to 
explain its meaning in detail. 


1. It is an existence and uniqueness theorem. This means that it is a theorem which 
tells us that under certain conditions (stated in the hypothesis) something exists (the 
solution described in the conclusion) and is unique (there is only one such solution). It 
gives no hint whatsoever concerning how to find this solution but merely tells us that 
the problem has a solution. 

2. The hypothesis tells us what conditions are required of the quantities involved. It 
deals with two objects: the differential equation (1.28) and the point (x 0 , y 0 )- As f ar as 
the differential equation (1.28) is concerned, the hypothesis requires that both the 
function / and the function df/dy (obtained by differentiating /(x, y) partially with 
respect to y) must be continuous in some domain D of the xy plane. As far as the point 
(x 0 , y 0 ) is concerned, it must be a point in this same domain D, where / and df/dy are so 
well behaved (that is, continuous). 

3. The conclusion tells us of what we can be assured when the stated hypothesis is 
satisfied. It tells us that we are assured that there exists one and only one solution (f of 
the differential equation, which is defined on some interval \x — x 0 \ < h centered about 
x 0 and which assumes the value y 0 when x takes on the value x 0 . That is, it tells us that, 
under the given hypothesis on /(x, y), the initial-value problem 


= fix, y), 


y(*o) = yo > 

has a unique solution that is valid in some interval about the initial point x 0 . 

4. The proof of this theorem is omitted. It is proved under somewhat less restrictive 
hypotheses in Chapter 10. 

5. The value of an existence theorem may be worth a bit of attention. What good is 
it, one might ask, if it does not tell us how to obtain the solution? The answer to this 
question is quite simple: an existence theorem will assure us that there is a solution to 
look for! It would be rather pointless to spend time, energy, and even money in trying to 
find a solution when there was actually no solution to be found! As for the value of the 
uniqueness, it would be equally pointless to waste time and energy finding one 
particular solution only to learn later that there were others and that the one found was 
not the one wanted! 



1.3 INITIAL-VALUE PROBLEMS, BOUNDARY-VALUE PROBLEMS 21 


We have included this rather lengthy discussion in the hope that the student, who has 
probably never before encountered a theorem of this type, will obtain a clearer idea of 
what this important theorem really means. We further hope that this discussion will 
help him to analyze theorems which he will encounter in the future, both in this book 
and elsewhere. We now consider two simple examples which illustrate Theorem 1.1. 


► Example 1.15 

Consider the initial-value problem 


y(i) = 3. 

Let us apply Theorem 1.1. We first check the hypothesis. Here /(x, y) = x 2 + y 2 

and — ~ — = 2 y. Both of the functions / and df/dy are continuous in every domain 

dy 

D of the xy plane. The initial condition j/( 1 ) = 3 means that x 0 = 1 and y 0 = 3, and the 
point (1,3) certainly lies in some such domain D. Thus all hypotheses are satisfied and 
the conclusion holds. That is, there is a unique solution 0 of the differential equation 
dy/dx = x 2 + y 2 , defined on some interval |x — 1 1 < h about x 0 = 1, which satisfies 
that initial condition, that is, which is such that 0(1) = 3. 


► Example 1.16 

Consider the two problems: 

y( i) = 2, 


i = y 

dx 


2. 


dx 

Here 


= 4=, y(0) = 2. 


/(*, _>>) 


y 

x 112 


and 


d/(x, y) 
dy 


1 

i 171 * 


These functions are both continuous except for x = 0 (that is, along the y axis). In 
problem 1, x 0 = 1, y 0 = 2. The square of side 1 centered about (1,2) does not contain 
the y axis, and so both / and df /dy satisfy the required hypotheses in this square. Its 
interior may thus be taken to be the domain D of Theorem 1.1; and (1,2) certainly lies 
within it. Thus the conclusion of Theorem 1.1 applies to problem 1 and we know the 
problem has a unique solution defined in some sufficiently small interval about x 0 = 1. 

Now let us turn to problem 2. Here x 0 = 0, y 0 = 2. At this point neither / nor df/dy 
are continuous. In other words, the point (0, 2) cannot be included in a domain D where 
the required hypotheses are satisfied. Thus we can not conclude from Theorem 1.1 that 
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problem 2 has a solution. We are not saying that it does not have one. Theorem 1.1 
simply gives no information one way or the other. 


Exercises 

1. Show that 

y = 4e 2x + 2e~ 3x 

is a solution of the initial-value problem 

y(0) = 6, 
y'( 0) = 2 . 

Is y = 2e 2x + 4e~ 3x also a solution of this problem? Explain why or why not. 

2. Given that every solution of 

dy 


dx 


+ y = 2xe x 


may be written in the form y = (x 2 + c)e x , for some choice of the arbitrary 
constant c, solve the following initial-value problems: 


dy 

(a) -f - + y = 2xe x , 
dx 

y( o) = 2 . 

3. Given that every solution of 


dy 

(b) ^ + y = 2xe x , 
y(-l) = e + 3. 




may be written in the form 


y = c t e 4x + c 2 e 3x , 


for some choice of the arbitrary constants c x and c 2 , solve the following initial- 
value problems: 


<■> 




y( 0) = 5, 

y'(0) = 6. 

4. Every solution of the differential equation 

d 2 y 


y( 0) = - 2 , 
/(0) = 6. 


dx 


2 + y = 0 



1.3 


INITIAL-VALUE PROBLEMS, BOUNDARY- VALUE PROBLEMS 23 


may be written in the form y = c i sin x + c 2 cos x, for some choice of the arbitrary 
constants c x and c 2 . Using this information, show that boundary problems (a) and 
(b) possess solutions but that (c) does not. 


(a) 

d 2 y 

d^ +y = 0 ’ 

d 2 y 

(b) ^ + y = 0, 


o 

II 

o 

y(0) = i, 


y( n/2) = 1. 

yW) = - 

(c) 

d 2 y 

d^ +y = 0 ’ 



o 

II 

o 



y(n) = 1. 



5. Given that every solution of 


d 3 ' 


dx~ 


3 y - 2 d 2 y dy 

- 3x + 6x t: - 6 y = 0 


dx 2 


dx 


may be written in the form y = c x x + c 2 x 2 + c 3 x 3 for some choice of the arbitrary 
constants c l9 c 2 , and c 3 , solve the initial-value problem consisting of the above 
differential equation plus the three conditions 


y( 2) = o, y'( 2) = 2, y"( 2) = 6. 


6. Apply Theorem 1.1 to show that each of the following initial-value problems has 
a unique solution defined on some sufficiently small inverval |x — 1 1 < h about 

x 0 = 1: 


, v dy 2 . 

(a) — = x 2 sin y, 

y( 1 ) 2. 

7. Consider the initial-value problem 


(b) 


dx x — 2’ 

y( i) = o. 


— = P(x)y 2 + Q(x)y, 

y( 2) = 5, 


where P(x) and Q(x) are both third-degree polynomials in x. Has this problem a 
unique solution on some interval | x — 2 1 < h about x 0 = 2? Explain why or why 
not. 

8. On page 19 we stated that the initial- value problem 


y( 0) = o, 


has infinitely many solutions. 
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(a) Verify that this is indeed the case by showing that 

_ fO, x < c, 

y j[f(x - c)] 3/2 , x>c, 

is a solution of the stated problem for every real number c > 0. 

(b) Carefully graph the solution for which c = 0. Then, using this particular 
graph, also graph the solutions for which c = 1, c = 2, and c = 3. 



= CHAPTER TWO 

First-Order Equations for Which Exact Solutions Are Obtainable 


In this chapter we consider certain basic types of first-order equations for which exact 
solutions may be obtained by definite procedures. The purpose of this chapter is to gain 
ability to recognize these various types and to apply the corresponding methods of 
solutions. Of the types considered here, the so-called exact equations considered in 
Section 2.1 are in a sense the most basic, while the separable equations of Section 2.2 are 
in a sense the “easiest.” The most important, from the point of view of applications, are 
the separable equations of Section 2.2 and the linear equations of Section 2.3. The 
remaining types are of various very special forms, and the correponding methods 
of solution involve various devices. In short, we might describe this chapter as a col- 
lection of special “methods,” “devices,” “tricks,” or “recipes,” in descending order of 
kindness! 


2.1 EXACT DIFFERENTIAL EQUATIONS AND INTEGRATING FACTORS 


A. Standard Forms of First-Order Differential Equations 

The first-order differential equations to be studied in this chapter may be expressed in 
either the derivative form 


j- = fix, y ) 
ax 


(2.1) 


or the differential form 


M(x, y) dx + N(x, y) dy = 0. 


( 2 . 2 ) 

25 
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An equation in one of these forms may readily be written in the other form. For 
example, the equation 

dy x 2 + y 2 

dx x — y 

is of the form (2.1). It may be written 

(x 2 -f y 2 ) dx + {y — x) dy = 0, 
which is of the form (2.2). The equation 

(sin x + y) dx + (x + 3 y) dy = 0, 
which is of the form (2.2), may be written in the form (2.1) as 

dy sin x + y 

dx x T- 3 y ‘ 

In the form (2. 1 ) it is clear from the notation itself that y is regarded as the dependent 
variable and x as the independent one; but in the form (2.2) we may actually regard 
either variable as the dependent one and the other as the independent. However, in 
this text, in all differential equations of the form (2.2) in x and y, we shall regard y 
as dependent and x as independent, unless the contrary is specifically stated. 


B. Exact Differential Equations 
DEFINITION 


Let F be a function of two real variables such that F has continuous first partial 
derivatives in a domain D. The total differential dF of the function F is defined by the 
formula 


for all (x, y) e D. 


dF(x , y) = 


e ^ dx + 


SFjx, y) 

dy 


dy 


► Example 2.1 


Let F be the function of two real variables defined by 

F(x, y) = xy 2 + 2 x 3 y 

for all real (x, y). Then 

dF (x,y) „ 2j _ a , dF(x,y)_^ , - 3 


= y + 6x 2 y, 


= 2xy + 2x 3 , 


dx J dy 

and the total differential dF is defined by 

dF(x , y) = (y 2 + 6x 2 y) dx + (2xy + 2x 3 ) dy 


for all real (x, y). 
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DEFINITION 


The expression 


M(x, y) dx -f N(x, y) dy 


(2.3) 


is called an exact differential in a domain D if there exists a function F of two real vari- 
ables such that this expression equals the total differential dF(x, y) for all (x, y) e D. 
That is, expression (2.3) is an exact differential in D if there exists a function F such 
that 


8F(x, y) 
dx 


= M(x, y) 


and 


8F(x, y) 
dy 


N(x, y) 


for all (x, y) e D. 

If M(x, y) dx + N(x, y) dy is an exact differential, then the differential equation 


M(x, y) dx + N(x, y) dy = 0 
is called an exact differential equation. 


► Example 2.2 

The differential equation 

y 2 dx + 2 xy dy = 0 (2.4) 

is an exact differential equation, since the expression y 2 dx + 2 xy dy is an exact 
differential. Indeed, it is the total differential of the function F defined for all (x, y) 
by F(x, y) = xy 2 , since the coefficient of dx is dF(x , y)/(dx) = y 2 and that of dy is 
dF(x, y)/(dy) = 2 xy. On the other hand, the more simple appearing equation 

y dx + 2x dy = 0, (2.5) 

obtained from (2.4) by dividing through by y, is not exact. 

In Example 2.2 we. stated without hesitation that the differential equation (2.4) is 
exact but the differential equation (2.5) is not. In the case of Equation (2.4), we verified 
our assertion by actually exhibiting the function F of which the expression y 2 dx + 
2xy dy is the total differential. But in the case of Equation (2.5), we did not back up 
our statement by showing that there is no fucntion F such that y dx + 2x dy is its total 
differential. It is clear that we need a simple test to determine whether or not a given 
differential equation is exact. This is given by the following theorem. 


THEOREM 2.1 

Consider the differential equation 

M(x, y) dx + N(x, y) dy = 0, (2.6) 

where M and N have continuous first partial derivatives at all points (x, y) in a rectangular 
domain D. 
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1. If the differential equation (2.6) is exact in D , then 

dM(x, y) _ dN(x , y) 
dy dx 

for all (x, y) e D. 

2. Conversely, if 

dM(x, y) _ dN(x, y) 
dy dx 

for all (x, y) e D, then the differential equation (2.6) is exact in D . . 


(2.7) 


Proof. Part 1 . If the differential equation (2.6) is exact in D, then M'dx + N dy is an 
exact differential in D. By definition of an exact differential, there exists a function F 
such that 


dF(x,y) 

dx 


M(x, y) and 


dF(x, y) 
dy 


=-N(x,y) 


for all (x, y) e D. Then 

d 2 F(x, y) dM(x, y) d 2 F(x,y) dN~(x,y) 

— r — r — = and — - — - — = — 

dy dx dy dx dy dx 


for all (x, y) e D. But, using the continuity of the first partial derivatives of M and N, we 
have 


d 2 F(x,y) _ d 2 F(x, y) 
dy dx dx dy 

and therefore 

dM(x , y) dN(x, y) 

dy dx 

for all (x, y) e D. 

Part 2. This being the converse of Part 1, we start with the hypothesis that 

dM(x , y) dN(x, y) 

dy dx 

for all (x, y) e D, and set out to show that M dx + N dy = 0 is exact in D. This means 
that we must prove that there exists a function F such that 


and 


dF(x, y) 
dx 


= M(x, y) 


(2.8) 


dF(x, y) 
dy 


N(x, y) 


(2.9) 


for all (x, y) e D. We can certainly find some F(x, y) satisfying either (2.8) or (2.9), but 
what about both? Let us assume that F satisfies (2.8) and proceed. Then 

F(x, y) = f Af(x, y) dx + <t>(y). 


(2.10) 
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where J M(x, y) dx indicates a partial integration with respect to x, holding y constant, 
and 0 is an arbitrary function of y only. This 0(y) is needed in (2.10) so that F(x, y) 
given by (2.10) will represent all solutions of (2.8). It corresponds to a constant of inte- 
gration in the “one- variable” case. Differentiating (2.10) partially with respect to y, 
we obtain 


dy dy J 


y) dx + 


Now if (2.9) is to be satisfied, we must have 




M(x, y) dx -f 


# oo 

dy 

#(y) 

dy 


(2.11) 


and hence 


d<t>(y) ... d 


M(x, y) dx. 


Since </> is a function of y only, the derivative dcfr/dy must also be independent of x. That 
is, in order for (2.1 1) to hold, 


N(x, y) — 


dy 


M(x, y) dx 


( 2 . 12 ) 


must be independent of x. 
We shall show that 

d_ 

dx 

We at once have 


N(x 9 y)- T 

dy 


d_ 

dx 


N(x, y) 


dy) 


M(x, y) dx 


Af(x, y) dx 


dN (x, y) 
dx 


= 0 . 


dx dy 


M(x, y) dx. 


If (2.8) and (2.9) are to be satisfied, then using the hypothesis (2.7), we must have 


dx dy 

Thus we obtain 


M(x, y) dx 


d 2 F(x, y) d 2 F(x, y) 


dx dy 


I 


d_ 

dx 


j^AT(x, y) - ^ | M(x, y) J = 


and hence 


d_ 

dx 


(x, y) - ^ | M(x, y) dx J = 


dy dx dy dx 

dN(x , y) d 2 

dx dy dx J 

dN(x , y) 3M(x, y) 


M(x, y) 5x. 


M(x, y) dx 


dx 


dy 


But by hypothesis (2.7), 


dM(x, y) dN(x , y) 


5y 


ax 
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for all (x, y) e D. Thus 


d_ 

dx 




M(x, y) dx 


= 0 


for all (x, y) e D, and so (2.12) is independent of x. Thus we may write 

' dM(x, y) 


<Ky) = 


N(x, y) 


P 


dy 


dx 


dy. 


Substituting this into Equation (2.10), we have 


F(x, y) = J M(x, y) dx + j* N (x, y) - J* — dx J dy. (2. 1 3) 

This F(x, y) thus satisfies both (2.8) and (2.9) for all (x, y) e D , and so M dx + N dy = 0 
is exact in D. Q.E.D. 


Students well versed in the terminology of higher mathematics will recognize that 
Theorem 2. 1 may be stated in the following words: A necessary and sufficient condition 
that Equation (2.6) be exact in D is that condition (2.7) hold for all (x, y) e D. For 
students not so well versed, let us emphasize that condition (2.7), 

d Af(x, y) _ dN(x, y) 
dy dx 

is the criterion for exactness. If (2.7) holds, then (2.6) is exact; if (2.7) does not hold, then 
(2.6) is not exact. 


► Example 2.3 


We apply the exactness criterion (2.7) to Equations (2.4) and (2.5), introduced in 
Example 2.2. For the equation 

y 2 dx + 2xy dy = 0 (2.4) 

we have 


M(x, y) = y 2 , N(x, y) = 2 xy, 

dM(x, y) dN(x, y) 

— 2,1 — 

for all (x, y). Thus Equation (2.4) is exact in every rectangular domain D. On the other 
hand, for the equation 

y dx + 2x dy = 0, (2.5) 


we have 


M(x,y) = y, N(x,y) = 2x, 

dM(x, y) = dN(x, y) 

dy dx 

for all (x, y). Thus Equation (2.5) is not exact in any rectangular domain D. 
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► Example 2.4 

Consider the differential equation 

(2x sin y + y 3 e x ) dx + (x 2 cos y + 3 y 2 e x ) dy = 0. 

Here 


M(x, y) = 2x sin y + y 3 e x , 
N(x, y) = x 2 cos y -f 3y 2 e x , 


dM(x, y) 
dy 


= 2x cos y + 3y 2 e x = 


dN(x, y) 
dx 


in every rectangular domain D. Thus this differential equation is exact in every such 
domain. 


These examples illustrate the use of the test given by (2.7) for determining whether or 
not an equation of the form M(x, y) dx + N(x, y) dy = 0 is exact. It should be observed 
that the equation must be in the standard form M(x, y) dx + N{x, y) dy = 0 in order to 
use the exactness test (2.7). Note this carefully: an equation may be encountered in the 
nonstandard form M(x, y) dx = N(x, y) dy, and in this form the test (2.7) does not 
apply. 


C. The Solution of Exact Differential Equations 


Now that we have a test with which to determine exactness, let us proceed to solve exact 
differential equations. If the equation M(x, y) dx + N(x, y) dy = 0 is exact in a 
rectangular domain D, then there exists a function F such that 

and for all (x, y) e D. 


dx 

Then the equation may be written 


dy 


dF(x, v) , dF(x, y) f 

■ dx H 7 — - dy = 0 or simply dF(x, y) = 0. 


dx 


dy 


The relation F(x, y) = c is obviously a solution of this, where c is an arbitrary constant. 
We summarize this observation in the following theorem. 


THEOREM 2.2 

Suppose the differential equation M(x, y) dx + N(x, y) dy = 0 satisfies the differentia- 
bility requirements of Theorem 2.1 and is exact in a rectangular domain D. Then a one- 
parameter family of solutions of this differential equation is given by F(x , y) = c, where F 
is a function such that 

and for all (x, y) e D. 

dx dy 


and c is an arbitrary constant. 
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Referring to Theorem 2.1, we observe that F(x, y) is given by formula (2.13). 
However, in solving exact differential equations it is neither necessary nor desirable to 
use this formula. Instead one obtains F(x, y) either by proceeding as in the proof of 
Theorem 2.1, Part 2, or by the so-called “method of grouping,” which will be explained 
in the following examples. 


► Example 2.5 


Solve the equation 

(3x 2 -f 4 xy) dx + (2x 2 + 2y) dy = 0. 

Our first duty is to determine whether or not the equation is exact. Here 

M(x, y) = 3x 2 + 4 xy, N(x, y) = 2x 2 + 2 y, 

8M(x, y) = 4x dN(x , y) = ^ 

dy dx 

for all real (x, y), and so the equation is exact in every rectangular domain D. Thus we 
must find F such that 

= M(x, y) = 3x 2 + 4xy and = N(x, y) = 2x 2 + 2y. 

dx dy 

From the first of these, 


Then 


F(x, y) 


1 


M(x, y) dx + <t>(y) = 


= x 3 + 2x 2 y + (j>(y). 


(3x 2 + 4 xy) dx + cj)(y) 


But we must have 


Thus 


dF(x,y)_ ^ 2 d(f>(y) 

dy X dy 


8F(x, y) 
dy 


= N(x, y) = 2x 2 + 2 y. 


lx 2 + 2 y = 2x 2 + 


dy 


or 


d(t>{y) = 

dy 


Thus (l>(y ) = y 2 + c 0 , where c 0 is an arbitrary constant, and so 


F(x, y) = x 3 + 2x 2 y + y 2 + c 0 . 
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Hence a one-parameter family of solution is F(x, y) = c t , or 

x 3 + 2 x 2 y + y 2 + c 0 = c i . 

Combining the constants c 0 and c i we may write this solution as 

x 3 + 2x 2 y + y 2 = c, 

where c = c t — c 0 is an arbitrary constant. The student will observe that there is no loss 
in generality by taking c 0 = 0 and writing <j)(y) = y 2 . We now consider an alternative 
procedure. 


Method of Grouping. We shall now solve the differential equation of this exam- 
ple by grouping the terms in such a way that its left member appears as the sum of 
certain exact differentials. We write the differential equation 

(3x 2 -f 4 xy) dx + (2x 2 + 2y) dy = 0 

in the form 

3x 2 dx + (4 xy dx + 2x 2 dy) + 2 y dy = 0. 

We now recognize this as 

d{x 3 ) + d(2x 2 y) + d(y 2 ) = d(c ), 
where c is an arbitrary constant, or 

d(x 3 + 2x 2 y + y 2 ) = d(c). 

From this we have at once 

x 3 + 2x 2 y + y 2 = c. 

Clearly this procedure is much quicker, but it requires a good “working knowledge” of 
differentials and a certain amount of ingenuity to determine just how the terms should 
be grouped. The standard method may require more “work” and take longer, but it is 
perfectly straightforward. It is recommended for those who like to follow a pattern and 
for those who have a tendency to jump at conculsions. 

Just to make certain that we have both procedures well in hand, we shall consider an 
initial-value problem involving an exact differential equation. 


► Example 2.6 


Solve the initial-value problem 

(2x cos y -I- 3x 2 y) dx + (x 3 — x 2 sin y — y) dy = 0, 

y( 0 ) = 2 . 

We first observe that the equation is exact in every rectangular domain D, since 


dMjx, y) 
dy 


— 2x sin y -f 3x 2 = 


d N(x, y) 
dx 


for all real (x, y). 
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Standard Method. We must find F such that 

dF(x, y) 


and 


Then 


dx 


dF(x, y) 


dy 


F(x,y) 


M(x, y) = 2x cos y + 3 x 2 y 


N(x, y) = x 3 — x 2 sin y — y. 


y) dx + 4>(y) 


= j M(x, 

= j* (2x cos y + 3x 2 y) dx + 4>(y) 
= x 2 cos y + x 3 y + (j)(y ), 


dF(x, y) 2 . 3 d(f)(y) 

= —x 2 sin j; + x 3 +■ 


dy 


dy 


But also 


dF(x,y) 3 2 . 

— = N(x, y) = x — x 1 sin y — y 

dy 


and so 

d<Hy) 

dy 

and hence 


4>{y) = 



Thus 

y 2 

F(x, y) = x 2 cos y + x 3 y - — + c 0 . 

Hence a one-parameter family of solutions is F(x, y) = c 1? which may be expressed as 

2 3 y 2 

x cos y + x y — — = c. 

Applying the initial condition y = 2 when x = 0, we find c = — 2. Thus the solution of 
the given initial-value problem is 

y 2 

x 2 cos y + x 3 y — — = — 2 . 


Method of Grouping. We group the terms as follows: 

(2x cos y dx — x 2 sin y dy) + (3x 2 y dx + x 3 dy) — y dy = 0. 
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Thus we have 

d(x 2 cos y) -f d(x 2 y) — = ^( c )> 

and so 


2 3 y 

x* cos y + x y — — = c 

is a one-parameter family of solutions of the differential equation. Of course the initial 
condition y(0) = 2 again yields the particular solution already obtained. 


D. Integrating Factors 


Given the differential equation 


if 


M(x, y) dx -f N(x, y) dy = 0, 


<9Af(x, y) 8N(x, y) 
dy dx 

then the equation is exact and we can obtain a one-parameter family of solutions by 
one of the procedures explained above. But if 

dM(x , y) dN(x , y) 
dy dx 

then the equation is not exact and the above procedures do not apply. What shall we do 
in such a case? Perhaps we can multiply the nonexact equation by some expression that 
will transform it into an essentially equivalent exact equation. If so, we can proceed to 
solve the resulting exact equation by one of the above procedures. Let us consider 
again the equation 

y dx -f 2x dy = 0, (2.5) 

which was introduced in Example 2.2. In that example we observed that this equation is 
not exact. However, if we multiply Equation (2.5) by y, it is transformed into the 
essentially equivalent equation 

y 2 dx + 2 xy dy = 0, (2.4) 

which is exact (see Example 2.2). Since this resulting exact equation (2.4) is integrable, 
we call y an integrating factor of Equation (2.5). In general, we have the following 
definition: 


DEFINITION 

If the differential equation 


M(x, y) dx + N(x, y) dy = 0 


(2.14) 
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is not exact in a domain D but the differential equation 

ju(x, y)M(x , y) dx + ju(x, y)N(x, y) dy = 0 (2. 1 5) 

is exact in D , then n{x, y) is called an integrating factor of the differential equation {2.14). 


► Example 2.7 


Consider the differential equation 

(3y + 4xy 2 ) dx + (2x + 3 x 2 y) dy = 0. 

This equation is of the form (2.14), where 

M(x, y) = 3y + 4 xy 2 , N(x , y) = 2x + 3 x 2 y, 


dM(x , y) 
dy 


= 3 + 8xy, 


d N(x, y) 
dx 


= 2 + 6 xy. 


(2.16) 


Since 


dM(x , y) dN(x, y) 
dy dx 

except for (x, y) such that 2xy +1=0, Equation (2.16) is not exact in any rectangular 
domain D. 

Let ju(x, y) = x 2 y. Then the corresponding differential equation of the form (2.15) is 


(3x 2 y 2 + 4x 3 y 3 ) dx + (2x 3 y + 3x 4 y 2 ) dy = 0. 
This equation is exact in every rectangular domain D , since 


d[>(x, y)M(x, y)] ^ 2 , ^ 3 2 3[/i(x, y)N{x, y)] 

= 6x y + 12x y = 

dy dx 

for all real (x, y). Hence /i(x, y) = x 2 y is an integrating factor of Equation (2.16). 


Multiplication of a nonexact differential equation by an integrating factor thus 
transforms the nonexact equation into an exact one. We have referred to this result- 
ing exact equation as “essentially equivalent” to the original. This so-called essentially 
equivalent exact equation has the same one-parameter family of solutions as the non- 
exact original. However, the multiplication of the original equation by the integrating 
factor may result in either ( 1 ) the loss of (one or more) solutions of the original, or (2) the 
gain of (one or more) functions which are solutions of the “new” equation but not of the 
original, or (3) both of these phenomena. Hence, whenever we transform a nonexact 
equation into an exact one by multiplication by an integrating factor, we should check 
carefully to determine whether any solutions may have been lost or gained. We shall 
illustrate an important special case of these phenomena when we consider separable 
equations in Section 2.2. See also Exercise 22 at the end of this section. 

The question now arises: How is an integrating factor found? We shall not attempt to 
answer this question at this time. Instead we shall proceed to a study of the important 
class of separable equations in Section 2.2 and linear equations in Section 2.3. We shall 
see that separable equations always possess integrating factors that are perfectly 
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obvious, while linear equations always have integrating factors of a certain special 
form. We shall return to the question raised above in Section 2.4. Our object here has 
been merely to introduce the concept of an integrating factor. 


Exercises 


In Exercises 1-10 determine whether or not each of the given equations is exact; solve 
those that are exact. 

1. (3x + 2 y) dx + (2x + y) dy = 0. 

2. ( y 2 + 3 ) dx + (2 xy — 4) dy = 0. 

3. (2 xy + 1) dx + (x 2 + 4 y) dy = 0. 

4. (3x 2 y + 2) dx — (x 3 + y) dy = 0. 

5. (6xy + 2y 2 — 5) dx + (3x 2 + 4xy — 6) dy = 0. 

6. (6 2 + l)cos r dr + 26 sin r d6 = 0. 

7. (y sec 2 x + sec x tan x) dx -f (tan x + 2y) dy = 0. 

8 . 

9. 


J 2 + x ) dx + ( ^ + y ) dy = 0. 
2s- 1 


10. 


t 

2y w + 1 

vl/2 


ds + 


s - s 4 


dt = 0. 


dx + (3x 1/2 y 1/2 


\) dy = 0. 


Solve the initial-value problems in Exercises 11-16. 

11. (2xy — 3) dx -f (x 2 + Ay) dy = 0, y(l) = 2. 

12. (3x 2 y 2 — y 3 + 2x) dx + (2x 3 y — 3xy 2 + 1) dy = 0, y( — 2) = 1. 

13. (2 y sin x cos x + y 2 sin x) dx + (sin 2 x — 2 y cos x) dy = 5 0 , y(0) = 3. 

14. ( ye x + 2e x + y 2 ) dx + ( e x + 2xy) dy = 0, j/(0) = 6. 


15. 


16. 


3 — y 


dx + 


y 2 — 2x 
xy 2 


dy = 0, 

1/3 


y(— i) = 2 . 


1 + 8xy 2/3 2 x 4/3 y 2/3 -x‘ 

dx + j/3 dy = 0, 


x 2/3 y 1/3 


y(i) — 8. 


17. In each of the following equations determine the constant A such that the 
equation is exact, and solve the resulting exact equation: 

(a) (x 2 + 3xy) dx + ( Ax 2 + 4y) dy = 0. 


(b) 


1 


1 


x 2 + y 2 


dx + 


Ax + 1 


dy = 0. 
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18. In each of the following equations determine the constant A such that the 
equation is exact, and solve the resulting exact equation: 

(a) ( Ax 2 y + 2 y 2 ) dx + (x 3 -f 4xy) dy = 0. 


(b) — + — j )dx + — j )dy — 0 . 


19. In each of the following equations determine the most general function N(x, y) 
such that the equation is exact: 

(a) (x 3 + xy 2 ) dx + N(x, 3 ;) dy = 0. 

(b) (x “ 2 y ~ 2 -F xy " 3 ) dx + N(x, y) dy = 0. 

20. In each of the following equations determine the most general function M(x, y) 
such that the equation is exact: 

(a) M(x, y) dx + ( 2 x 2 y 3 + x 4 y) dy = 0 . 

(b) Af(x, y) dx -f ( 2 ye x + y 2 e 2x ) dy = 0 . 


21. Consider the differential equation 

(4x + 3 y 2 ) dx + 2 xy dy = 0. 

(a) Show that this equation is not exact. 

(b) Find an integrating factor of the form x", where n is a positive integer. 

(c) Multiply the given equation through by the integrating factor found in (b) 
and solve the resulting exact equation. 

22. Consider the differential equation 

(y 2 + 2 xy) dx — x 2 dy = 0 . 

(a) Show that this equation is not exact. 

(b) Multiply the given equation through by y", where n is an integer, and then 
determine n so that y n is an integrating factor of the given equation. 

(c) Multiply the given equation through by the integrating factor found in (b) 
and solve the resulting exact equation. 

(d) Show that y = 0 is a solution of the original nonexact equation but is not a 
solution of the essentially equivalent exact equation found in step (c). 

(e) Graph several integral curves of the original equation, including all those 
whose equations are (or can be written) in some “special” form. 

23. Consider a differential equation of the form 

ly + x f(x 2 + y 2 )] dx + [y/(x 2 + y 2 ) - x] dy = 0 . 

(a) Show that an equation of this form is not exact. 

(b) Show that 1 /(x 2 + y 2 ) is an integrating factor of an equation of this form. 

24. Use the result of Exercise 23(b) to solve the equation 

[y -f x(x 2 + y 2 ) 2 ] dx + [y(x 2 + y 2 ) 2 — x] dy = 0 . 
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2.2 SEPARABLE EQUATIONS AND EQUATIONS REDUCIBLE TO THIS FORM 
A. Separable Equations 

DEFINITION 

An equation of the form 


F(x)G(y) dx + f(x)g(y) dy = 0 (2.17) 

is called an equation with variables separable or simply a separable equation. 

For example, the equation (x — 4 )y A dx — x 3 (y 2 — 3) dy = 0 is a separable equa- 
tion. 

In general the separable equation (2.17) is not exact, but it possesses an obvious 
integrating factor, namely \/f(x)G(y). For if we multiply Equation (2.17) by this 
expression, we separate the variables, reducing (2.17) to the essentially equivalent 
equation 


F(x) 

f(x) 


dx + 


gjy) 

G(y) 


dy = 0. 


(2.18) 


This equation is exact, since 


A PM] = o = — f AzT 

dy |_/( x ) J fo|_G(}0_’ 


Denoting F(x)/f(x) by M(x) and g{y)/G{y) by N(y), Equation (2.18) takes the form 
M(x) dx + N(y) dy = 0. Since M is a function of x only and N is a function of y only, 
we see at once that a one-parameter family of solutions is 

J M(x) dx + J N(y) dy = c, (2.19) 

where c is the arbitrary constant. Thus the problem of finding such a family of solutions 
of the separable equation (2.17) has reduced to that of performing the integrations 
indicated in Equation (2.19). It is in this sense that separable equations are the simplest 
first-order differential equations. 

Since we obtained the separated exact equation (2.18) from the nonexact equation 
(2.17) by multiplying (2.17) by the integrating factor 1 //(x)G(y), solutions may have 
been lost or gained in this process. We now consider this more carefully. In formally 
multiplying by the integrating factor l//(x)G(jz), we actually divided by f(x)G(y). We 
did this under the tacit assumption that neither f(x) nor G(y) is zero; and, under this 
assumption, we proceeded to obtain the one-parameter family of solutions given by 
(2.19). Now, we should investigate the possible loss or gain of solutions that may have 
occurred in this formal process. In particular, regarding y as the dependent variable as 
usual, we consider the situation that occurs if G(y) is zero. Writing the original dif- 
ferential equation (2.17) in the derivative form 


nmy)f x 


+ F(x)G(y) = 0, 
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we immediately note the following: If y 0 is any real number such that G(y 0 ) = 0, then 
y = y 0 is a (constant) solution of the original differential equation; and this solution 
may (or may not) have been lost in the formal separation process. 

In finding a one-parameter family of solutions a separable equation, we shall 
always make the assumption that any factors by which we divide in the formal 
separation process are not zero. Then we must find the solutions y = y 0 of the 
equation G(y) = 0 and determine whether any of these are solutions of the original 
equation which were lost in the formal separation process. 


► Example 2.8 

Solve the equation 

(x — 4 )y 4 dx — x 3 (y 2 — 3) dy = 0. 

The equation is separable; separating the variables by dividing by x 3 y 4 , we obtain 

(x - 4) dx (y 2 — 3) dy 


or 


(x“ 


4x 3 ) dx — (y 2 — 3 y 4 ) dy = 0. 


Integrating, we have the one-parameter family of solutions 

12 11 

1 2 “* 3 = C > 

x x z y y 

where c is the arbitrary constant. 

In dividing by x 3 y 4 in the separation process, we assumed that x 3 # 0 and y 4 # 0. 
We now consider the solution y = 0 of y 4 = 0. It is not a member of the one-parameter 
family of solutions which we obtained. However, writing the original differential 
equation of the problem in the derivative form 

dy (x — 4)y 4 
dx x 3 (y 2 — 3)’ 

it is obvious that y = 0 is a solution of the original equation. We conclude that it is a 
solution which was lost in the separation process. 


► Example 2.9 

Solve the initial-value problem that consists of the differential equation 

x sin y dx + (x 2 + 1) cos y dy = 0 (2.20) 

and the initial condition 



( 2 . 21 ) 


We first obtain a one-parameter family of solutions of the differential equation (2.20). 
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Separating the variables by dividing by (x 2 + 1) sin y, we obtain 


x , cos v , 

dx + — — — dy = 0. 


x 2 + 1 


sin y 


Thus 


x dx f cos v , 

+ \^rr^dy = c 09 


x 2 + 1 J sin y 

where c 0 is an arbitrary constant. Recall that 


I 


— = In \ u\ + C and \u\ = 
u 


u if u > 0, 
-u if u < 0. 


Then, carrying out the integrations, we find 

j In (x 2 + 1) + In | sin y | = c 0 . 


( 2 . 22 ) 


We could leave the family of solutions in this form, but we can put it in a neater form in 
the following way. Since each term of the left member of this equation involves the 
logarithm of a function, it would seem reasonable that something might be accom- 
plished by writing the arbitrary constant c 0 in the form In |c t |. This we do, obtaining 


j ln(x 2 + 1) + In | sin y \ = In |cj. 
Multiplying by 2, we have 

In (x 2 + 1) + 2 In | sin y\ = 2 In \c { \. 

Since 


and 


2 In | sin y\ = In (sin y) 2 , 


2 In \ c l | = In c 2 = In c, 


where 


c = c\ > 0, 


we now have 


In (x 2 + 1) + In sin 2 y = In c. 


Since In A + In B = In AB , this equation may be written 

In (x 2 + l)sin 2 y = In c. 

From this we have at once 

(x 2 + l)sin 2 y = c. 


(2.23) 


Clearly (2.23) is of a neater form than (2.22). 

In dividing by (x 2 + l)sin y in the separation process, we assumed that sin y # 0. 
Now consider the solutions of sin y = 0. These are given by y = nn (n = 0, ± 1, 
±2,...). Writing the original differential equation (2.20) in the derivative form, it is 
clear that each of these solutions y = nn{n = 0, ±1, ± 2, . . .) of sin y = 0 is a constant 
solution of the original differential equation. Now, each of these constant solutions 
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y = nn is a member of the one-parameter family (2.23) of solutions of (2.20) for c = 0. 
Thus none of these solutions was lost in the separation process. 

We now apply the initial condition (2.21) to the family of solutions (2.23). We have 

(l 2 + l)sin 2 j = c 

and so c = 2. Therefore the solution of the initial-value problem under consideration is 

(x 2 + l)sin 2 y = 2. 


B. Homogeneous Equations 

We now consider a class of differential equations that can be reduced to separable 
equations by a change of variables. 

DEFINITION 

The first-order differential equation M (x, y) dx -f N(x, y)dy = 0 is said to be homo- 
geneous ifi when written in the derivative form (dy/dx) = f (x, y), there exists a function g 
such that /(x, y) can be expressed in the form g{y/x). 

► Example 2.10 

The differential equation (x 2 — 3y 2 ) dx -f 2xy dy = 0 is homogeneous. To see this, we 
first write this equation in the derivative form 

dy 3 y 2 — x 2 

dx 2 xy 

Now observing that 

3y 2 — x 2 3y x 3 /y\ 1/ 1 \ 

2xy 2x 2y 2 \x/ 2 \y/x j 

we see that the differential equation under consideration may be written as 

dx 2\x) 2 \y/x/ 

in which the right member is of the form g(y/x) for a certain function g. 

► Example 2.11 

The equation 

(y + y/x 2 + y 2 ) dx — x dy = 0 
is homogeneous. When written in the form 

dy _ y + y/x 2 + y 2 
dx 


x 
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the right member may be expressed as 


lx 1 + y 2 


depending on the sign of x. This is obviously of the form g(y/x). 

Before proceeding to the actual solution of homogeneous equations we shall 
consider slightly different procedure for recognizing such equations. A function F is 
called homogeneous of degree n if F(tx, ty) = t n F(x , y). This means that if tx and ty are 
substituted for x and y, respectively, in F(x, y), and if t n is then factored out, the other 
factor that remains is the original expression F(x, y) itself. For example, the function F 
given by F(x, y) = x 2 -f y 2 is homogeneous of degree 2, since 

F(fx, ty) = (tx) 2 + (ty) 2 = t 2 (x 2 + y 2 ) = t 2 F(x, y). 

Now suppose the functions M and N in the differential equation M(x, y) dx + 
N(x, y) dy = 0 are both homogeneous of the same degree n. Then since M(tx, ty) = 
t n M(x, y), if we let r = 1 /x, we have 




M(x, y). 


Clearly this may be written more simply as 


Ml 1,— 

V x 


and from this we at once obtain 


Likewise, we find 


M(*,y)= - 


N(x , y) = I — 


— M(x,y); 


M K- 




Now writing the differential equation Af(x, y) dx + N(x, y) dy = 0 in the form 

dy = M(x, y) 
dx N(x, y) ’ 

we find 


- M 1 ^ 


- N l.i- 


/V( 1,- 


Clearly the expression on the right is of the form g(y/x ), and so the equation M(x, y) 
dx + N(x, y) dy = 0 is homogeneous in the sense of the original definition of homo- 
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geneity. Thus we conclude that if M and N in M(x, y) dx + N(x, y) dy = 0 are both 
homogeneous functions of the same degree n , then the differential equation is a 
homogeneous differential equation. 

Let us now look back at Examples 2.10 and 2.11 in this light. In Example 2.10, 
M(x, y) = x 2 — 3y 2 and N(x, y) = 2xy. Both M and N are homogeneous of degree 2. 
Thus we know at once that the equation (x 2 — 3y 2 ) dx + 2xy dy = 0 is a homogeneous 
equation. In Example 2.1 1, Af(x, y) = y + *Jx 2 + y 2 and N(x, y) = — x. Clearly N is 
homogeneous of degree 1. Since 

M(tx,ty) =ty + J{tx) 2 + (ty) 2 = t(y + J x 2 + y 2 ) = t l M(x, y), 
we see that M is also homogeneous of degree 1. Thus we conclude that the equation 

(y + ^/x 2 + y 2 ) dx — x dy = 0 

is indeed homogeneous. 

We now show that every homogeneous equation can be reduced to a separable 
equation by proving the following theorem. 


THEOREM 2.3 


if 


Af(x, y) dx + N(x, y) dy = 0 


(2.24) 


is a homogeneous equation , then the change of variables y — vx transforms (2.24) into a 
separable equation in the variables v and x. 


Proof. Since M(x, y) dx + N(x, y) dy = 0 is homogeneous, it may be written in the 
form 


Let y = vx. Then 


and (2.24) becomes 



dy dv 


V + X 


dv 

dx 


g(v) 


or 


[t> — g(t>)] dx + x dv = 0. 

This equation is separable. Separating the variables we obtain 

dv dx 

b — = 0. 

v - g(v) x 


(2.25) 

Q.E.D. 
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Thus to solve a homogeneous differential equation of the form (2.24), we let y = vx 
and transform the homogeneous equation into a separable equation of the form (2.25). 
From this, we have 

C dv f dx 

J «> - g(v) + J x " c ’ 

where c is an arbitrary constant. Letting F(v) denote 

f dv 

J v - g(v) 

and returning to the original dependent variable y, the solution takes the form 


F 



+ In | x | = c. 


► Example 2.1 2 

Solve the equation 

(x 2 — 3y 2 ) dx + 2 xy dy = 0. 

We have already observed that this equation is homogeneous. Writing it in the form 

dy_ = __x 3 y_ 
dx 2 y 2x 


and letting y = vx , we obtain 


or 


or, finally, 


dv 1 3v 
V + X T x --Tv + ~2 


dv 1 v 

X dx 2v + 2’ 


dv v 2 — 1 


dx 2v 

This equation is separable. Separating the variables, we obtain 

2v dv dx 


Integrating, we find 


and hence 


v 2 — 1 x 


ln|t> 2 - 1 1 = In | x | + In |c|, 


\v 2 -\\ = \cx\. 
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where c is an arbitrary constant. The reader should observe that no solutions were lost 
in the separation process. Now, replacing v by y/x we obtain the solutions in the 
form 


or 


| y 2 - x 2 1 = |cx|x 2 . 

If y > x > 0, then this may be expressed somewhat more simply as 

2 2 3 

y — x = cx . 


► Example 2.13 

Solve the initial-value problem 

(y + y/x 2 + y 2 ) dx - xdy = 0, 

y ( 1) = 0. 

We have seen that the differential equation is homogeneous. As before, we write it in the 
form 

dy _ y 4- y/x 2 + y 2 
dx x 

Since the initial x value is 1, we consider x > 0 and take x = y/x 2 and obtain 

dy_ = i 

dx x 



We let y = vx and obtain 


dv 

V + x — = V 
dx 


+ yrr^ 


or 



yrr^. 


Separating variables, we find 


dv dx 
+ 1 “ X ' 

Using tables, we perform the required integrations to obtain 
In 1 1 ? + y/v 2 + 1 1 = In | x | + In |c|, 


or 


V + y/v 2 + 1 = CX. 
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Now replacing v by y/x, we obtain the general solution of the differential equation in 
the form 



y + y/x 2 + y 2 = cx 2 . 

The initial condition requires that y = 0 when x = 1. This gives c = 1 and hence 

y + v/x 2 + y 2 = x 2 , 

from which it follows that 

y = i(x 2 - i). 


Exercises 


Solve each of the differential equations in Exercises 1-14. 

1. 4 xy dx + (x 2 + 1 ) dy = 0. 

2. (xy + 2x + y + 2) dx + (x 2 + 2x) dy = 0. 

3. 2r(s 2 + 1 ) dr + (r 4 + 1) ds = 0. 

4. esc y dx + sec x dy = 0. 

5. tan 6 dr + 2r d6 = 0. 

6. ( e v + l)cos u du + e"(sin u + 1) dv = 0. 

7. (x + 4)(y 2 + 1) dx + y(x 2 + 3x + 2) dy = 0. 

8. (x + y) dx — x dy = 0. 

9. (2xy + 3y 2 ) dx — (2 xy + x 2 ) dy = 0. 

10. v 3 du + (w 3 — uv 2 ) dv = 0. 

y 


11 . 


x tan — + y ) dx — x dy = 0. 


12. (2s 2 + 1st + t 2 ) ds + (s 2 + 2 st — t 2 ) dt = 0. 

13. (x 3 + y 2 y/x 2 + y 2 ) dx - xy Jx 2 + y 2 dy = 0. 


14. (Jx + y + Jx - y) dx + (y/x - y - Jx + y) dy = 0. 
Solve the initial-value problems in Exercises 15-20. 

15. (y + 2) dx + y(x + 4) dy = 0, y( — 3) = — 1 . 


y (r2 


16. 8 cos 2 y dx + esc 2 x dy = 0, 

17. (3x + 8)(y 2 + 4) dx — 4 y(x 2 + 5x + 6) dy = 0, 


71 

4* 


y( i) = 2. 
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18. (x 2 4- 3y 2 ) dx - 2xy dy = 0, y( 2) = 6. 

19. (2x — 5>;) dx -f (4x — y)dy = 0, y(l) = 4. 

20. (3x 2 4- 9 xy 4- 5y 2 ) dx - (6x 2 4 4 xy) dy = 0, y( 2) = -6. 

21. (a) Show that the homogeneous equation 

(Ax 4 By) dx 4- (Cx 4 Dy) dy = 0 

is exact if and only if B = C. 

(b) Show that the homogeneous equation 

(Ax 2 4 Bxy 4- Cy 2 ) dx 4 (Dx 2 4- Exy 4- Fy 2 ) dy = 0 

is exact if and only if B = 2D and E = 2 C. 

22. Solve each of the following by two methods (see Exercise 21(a)): 

(a) (x 4- 2 y) dx 4- (2x - y)dy = 0. 

(b) (3x — y) dx — (x 4 y) dy = 0. 

23. Solve each of the following by two methods (see Exercise 21(b)): 

(a) (x 2 4- 2y 2 ) dx 4- (4xy — y 2 ) dy = 0. 

(b) (2x 2 + 2xy 4 y 2 ) dx 4- (x 2 4 2xy) dy = 0. 

24. (a) Prove that if M dx + N dy = 0 is a homogeneous equation, then the change 

of variables x = uy transforms this equation into a separable equation in the 
variables u and x. 

(b) Use the result of (a) to solve the equation of Example 2.12 of the text. 

(c) Use the result of (a) to solve the equation of Example 2.13 of the text. 

25. Suppose the equation M dx 4- N dy = 0 is homogeneous. Show that the trans- 
formation x = r cos 0, y = r sin 0 reduces this equation to a separable equation in 
the variables r and 6. 

26. (a) Use the method of Exercise 25 to solve Exercise 8. 

(b) Use the method of Exercise 25 to solve Exercise 9. 

27. Suppose the equation 

Mdx + Ndy = 0 (A) 

is homogeneous. 

(a) Show that Equation (A) is invariant under the transformation 

x = k£, y = kr\ , (B) 

where k is a constant. 

(b) Show that the general solution of Equation (A) can be written in the form 

X = C(/)^J, (C) 


where c is an arbitrary constant. 



2.3 LINEAR EQUATIONS AND BERNOULLI EQUATIONS 


49 


(c) Use the result of (b) to show that the solution (C) is also invariant under the 
transformation (B). 

(d) Interpret geometrically the results proved in (a) and (c). 

2.3 LINEAR EQUATIONS AND BERNOULLI EQUATIONS 
A. Linear Equations 

In Chapter 1 we gave the definition of the linear ordinary differential equation of order 
n ; we now consider the linear ordinary differential equation of the first order. 


DEFINITION 


A first-order ordinary differential equation is linear in the dependent variable y and the 
independent variable x if it is, or can be, written in the form 


For example, the equation 


dy 

^ + P(x)y = Q(x). 


dy , 

X -f- + (x + \)y = x 3 
dx 


is a first-order linear differential equation, for it can be written as 


dy 


which is of the form (2.26) with P(x) = 1 + (1/x) and Q(x) = x 2 . 
Let us write Equation (2.26) in the form 

[P(x)y — Q(x)] dx + dy = 0. 

Equation (2.27) is of the form 

M(x, y) dx -f JV(x, y) dy = 0, 

where 


(2.26) 


(2.27) 


Since 


Af(x, y) = P(x)y — Q(x) and N(x, y) = 1. 


dM(x, y) 
dy 


= P(x) and 


dN(x, y) 
dx 


Equation (2.27) is not exact unless P(x) = 0, in which case Equation (2.26) degenerates 
into a simple separable equation. However, Equation (2.27) possesses an integrating 
factor that depends on x only and may easily be found. Let us proceed to find it. Let us 
multiply Equation (2.27) by p(x), obtaining 

[>(x)P(x)y - ju(x)g(x)] dx + p{x) dy = 0. 


(2.28) 
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By definition, / i(x ) is an integrating factor of Equation (2.28) if and only if Equation 
(2.28) is exact; that is, if and only if 

— ln(x)P(x)y - //(x)G(x)] = — ln(x)l 

This condition reduces to 

H(x)P(x) = |>(x)]. (2.29) 

In (2.29); P is a known function of the independent variable x, but fi is an unknown 
function of x that we are trying to determine. Thus we write (2.29) as the differential 
equation 

"'’ w = % 

in the dependent variable n and the independent variable x, where P is a known 
function of x. This differential equation is separable; separating the variables, we have 

— = P(x) dx. 

P 

Integrating, we obtain the particular solution 


In \n\ = 


P(x) dx 


or 

fi = e 5 Pix) dx , (2.30) 

where it is clear that /i > 0. Thus the linear equation (2.26) possesses an integrating 
factor of the form (2.30). Multiplying (2.26) by (2.30) gives 

e IP(x)dx + e !P(x)dx p^ y = Q( x ) e SP(xUX' 


which is precisely 


d_ 

dx 


| ypwdxy] = Q( x ) e SP{x)dx . 


Integrating this we obtain the solution of Equation (2.26) in the form 

where c is an arbitrary constant. 

Summarizing this discussion, we have the following theorem: 


THEOREM 2.4 

The linear differential equation 

T- + P My = ew (2-26) 
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has an integrating factor of the form 


0 iP(x)dx 


A one-parameter family of solutions of this equation is 

ye ,-^.^ e ,n^ mdx + c , 


that is , 


_ -JP(x)dx JP(x)dx 


y = e 


U* 


x Q(x) dx + c 


Furthermore, it can be shown that this one-parameter family of solutions of the linear 
equation (2.26) includes all solutions of (2.26). 

We consider several examples. 


► Example 2.14 


Here 


dy f2x -f 1 

dx \ x 


+ I — | y = e 2x . 


(2.31) 


P(x) = 


2x + 1 


and hence an integrating factor is 


exp 


P(x) dx = exp J = ex P(^ x + l n \ x 


I) 


= exp(2x) exp(ln |x|) = x exp(2x).* 

Multiplying Equation (2.31) through by this integrating factor, we obtain 

o2x d f_ , n 2x( 


xe 2x — + e 2x (2x + l)y = x 
dx 


or 


dx 

Integrating, we obtain the solutions 


(xe 2x y) = x. 


or 


9 v 

xe y = — + c 


y = jxe 2x + — e 2x , 


where c is an arbitrary constant. 


The expressions e x and exp x are identical. 
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► Example 2.15 

Solve the initial-value problem that consists of the differential equation 

(x 2 + 1) 7^ + 4 xy = x (2.32) 

ax 

and the initial condition 

y( 2)= 1. (2.33) 

The differential equation (2.32) is not in the form (2.26). We therefore divide by x 2 + 1 
to obtain 


dy 4x 

h — r y ■ 


dx ' x 2 + 1 y x 2 + r 
Equation (2.34) is in the standard form (2.26), where 

4x 


P(x) = 


x 2 + r 


An integrating factor is 


exp 


P(x) dx 


= exp 


f 4x dx\ 
}x 2 + l) 


exp[ln(x 2 + l) 2 ] = (x 2 + 1) 


Multiplying Equation (2.34) through by this integrating factor, we have 

(x 2 + 1 ) 2< t~ + 4x(x 2 + 1 )y = x(x 2 + 1) 
dx 


(2.34) 


or 

4-l(x 2 + l) 2 y] = x 3 + x. 
dx 

We now integrate to obtain a one-parameter family of solutions of Equation (2.23) in 
the form 

/ 2 ix2 * 2 

(X 2 + 1 ) 2 y = -J- + y + c. 

Applying the initial condition (2.33), we have 

25 = 6 + c. 

Thus c = 19 and the solution of the initial-value problem under consideration is 

(X 2 + l) 2 y = ^ + y+ 19. 


► Example 2.16 

Consider the differential equation 

y 2 dx + (3 xy — 1) dy = 0. 


(2.35) 
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Solving for dy/dx, this becomes 

dy = y 2 
dx 1 — 3 xy ’ 

which is clearly not linear in y. Also, Equation (2.35) is not exact, separable, or 
homogeneous. It appears to be of a type that we have not yet encountered; but let us 
look a little closer. In Section 2.1, we pointed out that in the differential form of a first- 
order differential equation the roles of x and y are interchangeable, in the sense that 
either variable may be regarded as the dependent variable and the other as the 
independent variable. Considering differential equation (2.35) with this in mind, let us 
now regard x as the dependent variable and y as the independent variable. With this 
interpretation, we now write (2.35) in the derivative form 

dx 1 — 3xy 
dy y 2 


or 


dx 3 1 

~A ^ X = — j • 

dy y r 


(2.36) 


Now observe that Equation (2.36) is of the form 


— + P(y)x = Q(y) 
dy 


and so is linear in x. Thus the theory developed in this section may be applied to 
Equation (2.36) merely by interchanging the roles played by x and y. Thus an inte- 
grating factor is 


exp 


P(y) dy 



\y\ 3 ) = y 3 . 


Multiplying (2.36) by y 3 we obtain 


y 



3 y 2 x = y 


or 


~ [y 3 x] = y. 

dy 

Integrating, we find the solutions in the form 

3 .V 2 

y x = — + c 
or 

1 c 


where c is an arbitrary constant. 
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B. Bernoulli Equations 

We now consider a rather special type of equation that can be reduced to a linear 
equation by an appropriate transformation. This is the so-called Bernoulli equation. 


DEFINITION 

An equation of the form 

+ P(x)y = Q(x)y" (2.37) 

is called a Bernoulli differential equation. 

We observe that if n = 0 or 1, then the Bernoulli equation (2.37) is actually a linear 
equation and is therefore readily solvable as such. However, in the general case in which 
n # 0 or 1, this simple situation does not hold and we must proceed in a different 
manner. We now state and prove Theorem 2.5, which gives a method of solution in the 
general case. 


THEOREM 2.5 

Suppose n # 0 or 1. Then the transformation v = y l ~ n reduces the Bernoulli equation 

+ P{x)y = Q(x)y" (2.37) 


to a linear equation in v. 


Proof. We first multiply Equation (2.37) by y ”, thereby expressing it in the 
equivalent form 


y- n ~ + p (x)y l ~ n = Q(x). 


(2.38) 


If we let v = y l ”, then 



- n)y~ n 


dy_ 

dx 


and Equation (2.38) transforms into 

T ±-£ + PMc = eM 

or, equivalently, 

+ n)P(x)v = (1 - n)Q(x). 


Letting 


Pi(x) = (1 - n)P(x) 
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and 


this may be written 


which is linear in v. 


Q iW = (1 - n)Q(x), 

dv 

— + Pi(x)v = Qy(x). 


Q.E.D. 


► Example 2.1 7 


dy 3 

d^ +, - xy 


(2.39) 


This is a Bernoulli differential equation, where n = 3. We first multiply the equation 
through by y ~ 3 , thereby expressing it in the equivalent form 

_ 3 dy _ 2 
y + , 2 = x. 

If we let v = y l ~ n = y ~ 2 , then dv/dx = —2 y~ 3 (dy/dx) and the preceding differential 
equation transforms into the linear equation 

1 dv 


2 dx 


-f v = x. 


Writing this linear equation in the standard form 

^-2„=-2x, 

dx 


(2.40) 


we see that an integrating factor for this equation is 
Multiplying (2.40) by e ~ 2x , we find 


e SP(x)dx _ e ~S2dx _ e ~2x 


dv 


e ' 2x 2e 2x v — —2xe~ 

dx 


2x 


or 


A( e -2*t,)= -2xe~ 2x . 
dx 


Integrating, we find 


e 2x v = \e 2x (2x + 1) + c, 
v = x + i + ce 2x , 



where c is an arbitrary constant. But 
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Thus we obtain the solutions of (2.39) in the form 


- = x + j + ce 2x . 


Note. Consider the equation 


df{y)dy 


+ P(x)f(y) = Q(x), 


where / is a known function of y. Letting v = f(y),v/Q have 


and Equation (2.41) becomes 


dv dv dy df(y) dy 
dx dy dx dy dx ’ 


+ P(x)v = Q(x ), 


which is linear in v. We now observe that the Bernoulli differential equation (2.37) is a 
special case of Equation (2.41). Writing (2.37) in the form 

y- n j^ + p (x)y l - n = Q(x) 

and then multiplying through by (1 - n ), we have 

(l-n)y- n ^ + P l (x)y l - n = Q 1 (x), 

where P x (x) = (1 - n)P(x) and Q^x) = (1 - n)Q(x). This is of the form (2.41), where 
f(y) = y 1 letting v = y 1 it becomes 

dv 

— + P t (x) v = Q t (x), 

which is linear in v. For other special cases of (2.41), see Exercise 37. 


Exercises 


Solve the given differential equations in Exercises 1-18. 

1. ^ + ^-6*’. 2. x 4 y-+ 2x 3 y = 1. 

dx x dx 


3. -y- + 3y = 3x 2 e~ 3x . 
dx 

dx x 1 

5. — 1 j = ~2 * 

dt t 2 t 2 

dy 2x + 1 , 

7. x/ + -— T y = x- 1. 

dx x + 1 


4. -f- + 4 xy = 8x. 

dx 


6. (u 2 + 1) — + 4mi; = 3 u. 
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8 . 

9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 
17. 


(x 2 + x — 2)-j- + 3(x + 1)>» = x — 1. 
ax 

x dy + (xy + y — 1) dx = 0. 
y dx + (xy 2 + x — y)dy = 0. 

^ + r tan 6 = cos 6. 


cos 6 dr + (r sin 0 — cos 4 9) dd = 0. 

(cos 2 x — y cos x) dx — (1 + sin x) dy = 0. 
(y sin 2x — cos x) dx + (1 + sin 2 x) dy = 0. 

* 1 - 1 = 16 . 
dx x x 

dy -f (Ay — 8 y~ 3 )x dx = 0. 18. 


x^~ + y = -2xV- 

dx 

dx t - hi t - hi 

-7- + — — X = . 

dt It xt 


Solve the initial-value problems in Exercises 19-30. 

19. x~~2y = 2x\ y(2) = 8. 

dx 

20. ^ + 3x 2 y = x 2 , y(0) = 2. 


21. e x ly - 3(e x + l) 2 ] dx + (e x + 1) dy = 0, y(0) = 4. 

22. 2x(y + 1) dx - (x 2 + 1) dy = 0, y(l) = - 5. 

23. ^ + r tan 6 = cos 2 6, = 1. 


24. 


— x = sin It, 

dt 


x(0) = 0. 


25. ^ + A = ^ 

dx 2x v 3 


26. x ^ + y = (xy) 312 , 

dx 


y( i) = 2. 

y(i) = 4. 


27 ^ + y = /M - where /(x) = {a °>r'’ ,,(0, = 0 ' 

28 where /w= {i! ° > w* 10 , ’ (0, * 6 ' 

22 % Jry ~ m where /M= {‘-'. ltl <2 ’ m -'- 

30- (x + 2)g + y = f(x), where f(x ) = j^*’ ^ > * < ^ J^ 0 ) = 4 - 
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31. Consider the equation a(dy/dx) + by = ke Ax , where a , b , and k are positive 

constants and X is a nonnegative constant. 

(a) Solve this equation. 

(b) Show that if X = 0 every solution approaches k/b as x -► oo, but if X > 0 
every solution approaches 0 as x oo. 

32. Consider the differential equation 

% + = O' 

(a) Show that if / and g are two solutions of this equation and c i and c 2 are 
arbitrary constants, then c l f + c 2 g is also a solution of this equation. 

(b) Extending the result of (a), show that if f x , / 2 , ...,/„ are n solutions of this 
equation and c x , c 2 , . . . , c n are n arbitrary constants, then 

Yj c kfk 


33. 


is also a solution of this equation.. 
Consider the differential equation 

»• 


(A) 


where P is continuous on a real interval I. 

(a) Show that the function / such that f(x) = 0 for all x e / is a solution of this 
equation. 

(b) Show that if / is a solution of (A) such that /(x 0 ) = 0 for some x 0 e /, then 
/(x) = 0 for all x 6 /. 

(c) Show that if / and g are two solutions of (A) such that /(x 0 ) = g(x 0 ) for some 
x 0 g /, then /(x) = g(x) for all xe I. 


34. (a) Prove that if / and g are two different solutions of 

dy 


— + P(x)y = Q(x), 


(A) 


then / — g is a solution of the equation 

dy 


dx 


+ P(x)y = 0. 


(b) Thus show that if / and g are two different solutions of Equation (A) and c is 
an arbitrary constant, then 

c{f -g) + f 

is a one-parameter family of solutions of (A). 

35. (a) Let f x be a solution of 
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and f 2 be a solution of 


— + P(x)y = Q 2 (x) 9 

where P, Q l9 and Q 2 are all defined on the same real interval /. Prove that 
f\ + Si ls a solution of 


dy_ 

dx 


+ P(x)y = Q t (x) + Q 2 (x) 


on /. 

(b) Use the result of (a) to solve the equation 

dy 

— 4- y = 2 sin x + 5 sin 2x. 
dx 

36. (a) Extend the result of Exercise 35(a) to cover the case of the equation 

% + P(x)y = Z Qk(x), 

dx k = i 

where P,Q k (k = 1, 2, . . . , n) are all defined on the same real interval /. 
(b) Use the result obtained in (a) to solve the equation 

dy i 

- + , = ( £ sm tx. 

37. Solve each of the following equations of the form (2.41): 

, , dy 1 . 

(a) cos y - — i — sin y = 1. 

dx x 

(b) {y + l ^ + x (y 2 + 2 y) = x - 


38. The equation 


d y 

dx 


= A(x)y 2 + B(x)y + C(x) 


(A) 


is called RiccatVs equation. 

(a) Show that if /l(x) = 0 for all x, then Equation (A) is a linear equation, 
whereas if C(x) = 0 for all x, then Equation (A) is a Bernoulli equation. 

(b) Show that if / is any solution of Equation (A), then the transformation 

y = f + - 

V 

reduces (A) to a linear equation in v. 

In each of Exercises 39-41, use the result of Exercise 38(b) and the given solution to 
find a one-parameter family of solutions of the given Riccati equation: 

39. = (1 — x)y 2 + (2x — l)y — x; given solution /(x) = 1. 

dx 
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y 2 + xy + 1; given solution /(x) = x. 

8 xy 2 + 4x(4x + 1 )y — (8x 3 + 4x 2 - 1); given solution /(x) = x. 

Exercises: Miscellaneous Review 

Solve each of the differential equations in Exercises 1-14. Several can be solved by at 
least two different methods. 

1. 6 x 2 y dx — (x 3 + 1) dy = 0. 

2. (3x 2 y 2 — x)dy + (2xy 3 — y) dx = 0. 

3. (y — 1) dx + x(x + 1) dy = 0. 

4. (x 2 — 2 y) dx — x dy — 0. 

5. (3x — 5y) dx + (x + y) dy = 0. 

6. e 2x y 2 dx + ( e 2x y — 2 y) dy = 0. 

7. (8x 3 y - 12x 3 ) dx + (x 4 + 1) dy = 0. 

8. (2x 2 + xy + y 2 ) dx + 2x 2 dy = 0. 

^ dy _ 4x 3 y 2 - 3x 2 y 
dx x 3 — 2x 4 y 

10. (x + 1) ^ + xy = e~ x . 
dx 

dy = 2x - ly 
dx 3y — 8x ’ 

12. x 2 ^ + xy = xy 3 . 

dx 

13. (x 3 + \)^~ + 6x 2 y = 6x 2 . 

dx 

14 dy = 2x 2 + y 2 
dx 2 xy — x 2 

Solve the initial value problems in Exercises 15-24. 

15. (x 2 + y 2 ) dx - 2 xy dy = 0, y(l) = 2. 

16. 2 (y 2 + 4) dx + (1 - x 2 )y dy = 0, y( 3) = 0. 

17. (e 2x y 2 — 2x) dx + e 2x y dy = 0, y( 0) = 2. 

18. (3x 2 + 2xy 2 ) dx + (2x 2 y + 6 y 2 ) dy = 0, y(l) = 2. 

19. 4 xy = y 2 + 1, y(2) = 1. 
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20 . 

21 . 


dx 

dx 


2x + ly 
2x — 2 y 

xy 


+ 1 ’ 


yd) = 2. 
y(V i5) = 2 . 


2Z f + ^ > 2, < 2 >,(0) = a 

23. (x + 2) + y = /(*), where f(x) = j 2 *’ 0 ^ - 2 > j,(0) = 4 . 

ax [ 4, x > 2, 


24. 



y(i) = 1. 


2.4 SPECIAL INTEGRATING FACTORS AND TRANSFORMATIONS 

We have thus far encountered five distinct types of first-order equations for which 
solutions may be obtained by exact methods, namely, exact, separable, homogeneous, 
linear, and Bernoulli equations. In the case of exact equations, we follow a definite 
procedure to directly obtain solutions. For the other four types definite procedures for 
solution are also available, but in these cases the procedures are actually not quite so 
direct. In the cases of both separable and linear equations we actually multiply by 
appropriate integrating factors that reduce the given equations to equations that are of 
the more basic exact type. For both homogeneous and Bernoulli equations we make 
appropriate transformations that reduce such equations to equations that are of the 
more basic separable and linear types, respectively. 

This suggests two general plans of attack to be used in solving a differential equation 
that is not of one of the five types mentioned. Either (1) we might multiply the given 
equation by an appropriate integrating factor and directly reduce it to an exact 
equation, or (2) we might make an appropriate transformation that will reduce the 
given equation to an equation of some more basic type (say, one of the five types 
already studied). Unfortunately no general directions can be given for finding an 
appropriate integrating factor or transformation in all cases. However, there is a 
variety of special types of equations that either possess special types of integrating 
factors or to which special transformations may be applied. We shall consider a few of 
these in this section. Since these types are relatively unimportant, in most cases we shall 
simply state the relevant theorem and leave the proof to the exercises. 


A. Finding Integrating Factors 

The so-called separable equations considered in Section 2.2 always possess integrating 
factors that may be determined by immediate inspection. While it is true that some 
nonseparable equations also possess integrating factors that may be determined “by 
inspection,” such equations are rarely encountered except in differential equations texts 
on pages devoted to an exposition of this dubious “method.” Even then a considerable 
amount of knowledge and skill are often required. 
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Let us attempt to attack the problem more systematically. Suppose the equation 

M(x, y) dx + N(x 9 y)dy = 0 (2.42) 

is not exact and that /i(x, y) is an integrating factor of it. Then the equation 

/i(x, y)M(x , y) dx + ju(x, y)N (x, y) dy = 0 (2.43) 

is exact. Now using the criterion (2.7) for exactness, Equation (2.43) is exact if and only if 

Y y !>(*> y)M(x, y)] = |>(x, y)N (x, y)]. 


This condition reduces to 

ox dy L oy dx 


H(x, y). 


Here M and N are known functions of x and y, but ju is an unknown function of x and y 
that we are trying to determine. Thus we write the preceding condition in the form 


N(x, y) 


M(x, y) 


dfi <3Af(x, y) dN(x, y) 


Hence // is an integrating factor of the differential equation (2.42) if and only if it is a 
solution of the differential equation (2.44). Equation (2.44) is a partial differential 
equation for the general integrating factor /i, and we are in no position to attempt to 
solve such an equation. Let us instead attempt to determine integrating factors of 
certain special types. But what special types might we consider? Let us recall that the 
linear differential equation 

~ + P(*)y = Q(x) 

always possesses the integrating factor e JP(x)dx , which depends only upon x. Perhaps 
other equations also have integrating factors that depend only upon x. We therefore 
multiply Equation (2.42) by /i(x), where fi depends upon x alone. We obtain 

/z(x)M(x, y) dx + //(x)jV(x, y) dy = 0. 

This is exact if and only if 

— [ju(x)M(x, y)] = — [n(x)N(x, y)]. 

Now M and N are known functions of both x and y, but here the intergrating factor fi 
depends only upon x. Thus the above condition reduces to 

3Af(x, y) dN(x, y) d/i(x) 

—eT ~ " (x) + N ^- y) ST 


i r 

~ N(x, y) | 


dM(x, y) dN(x, y) 
dy dx 
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If 

1 dM(x,y) dN(x,y) 1 
N(x,y)[_ dy dx J 

involves the variable y, this equation then involves two dependent variables and we 
again have difficulties. However, if 

1 gAfjfc y) dN(x,y) 1 

N(x, y)[_ dy dx J 

depends upon x only, Equation (2.45) is a separated ordinary equation in the single 
independent variable x and the single dependent variable p. In this case we may 
integrate to obtain the integrating factor 

1 I” 8M (x, y) dN(x, y) . 

N(x, y) |_ dy dx 1 * 

In like manner, if 

1 T dN(x y y) dM(x, y) 

M(x, y) |_ dx dy 

depends upon y only, then we may obtain an integrating factor that depends only on y. 
We summarize these observations in the following theorem. 


Mx) = exp 


THEOREM 2.6 

Consider the differential equation 

M(x, y) dx + N(x, y) dy = 0. 

If 


depends upon x only , then 


depends upon y only , then 
ex p< 

is an integrating factor of Equation (2,42). 


1 

dM(x , y) 

dN(x, >’)1 

N(x, y) _ 

dy 

dx J 

f 1 

1" dM(x, y) 

dN(x, y) 

J N(x,y) 

l dy 

dx 

Equation (2.42). If 


i 

8N(x, y) 

dM(x, y)l 

Af(x, y) 

dx 

dy i 

f 1 

r 8N(x, y ) 

dM(x, y) 

J M(x, y) 

|_ dx 

dy 


dy 


(2.42) 


(2.46) 


(2.47) 


(2.48) 


(2.49) 


We emphasize that, given a differential equation, we have no assurance in general 
that either of these procedures will apply. It may well turn out that (2.46) involves y 



64 FIRST-ORDER EQUATIONS FOR WHICH EXACT SOLUTIONS ARE OBTAINABLE 


and (2.48) involves x for the differential equation under consideration. Then we must 
seek other procedures. However, since the calculation of the expressions (2.46) and 
(2.48) is generally quite simple, it is often worthwhile to calculate them before trying 
something more complicated. 


► Example 2.18 


Consider the differential equation 

(2x 2 + y) dx + (x 2 y — x) dy = 0. (2.50) 

Let us first observe that this equation is not exact, separable, homogeneous, linear, 
or Bernoulli. Let us then see if Theorem 2.6 applies. Here M(x, y) = 2x 2 + y, and 
N(x, y) = x 2 y — x, and the expression (2.46) becomes 


x z y — x 

This depends upon x only, and so 


x(x); - 1) 


2 

X* 


exp (-JI' 


dx ) = exp(-2 In | x | ) = - 


is an integrating factor of Equation (2.50). Multiplying (2.50) by this integrating factor, 
we obtain the equation 


(2 + -^jdx + (y-^dy = 0. (2.51) 

The student may readily verify that Equation (2.5 1) is indeed exact and that the solution 
is 



c. 


More and more specialized results concerning particular types of integrating factors 
corresponding to particular types of equations are known. However, instead of going 
into such special cases we shall now proceed to investigate certain useful 
transformations. 


B. A Special Transformation 

We have already made use of transformations in reducing both homogeneous and 
Bernoulli equations to more tractable types. Another type of equation that can be 
reduced to a more basic type by means of a suitable transformation is an equation of 
the form 

(a x x -\-b l y + c x ) dx + ( a 2 x + b 2 y + c 2 ) dy = 0. 

We state the following theorem concerning this equation. 
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THEOREM 2.7 

Consider the equation 

(a t x + b l y + c t ) dx + ( a 2 x + b 2 y + c 2 ) dy = 0, 
where a l9 b l9 c l9 a 2 , b 2 , and c 2 are constants . 

Case 1. If a 2 la i # b 2 /b 1? then the transformation 

x = X + h 9 
y = Y + K 

where (h, k) is the solution of the system 

a±h -|- b±k -|“ Ci = 0 , 
a 2 h + b 2 k + c 2 = 0, 
reduces Equation (2.52) to the homogeneous equation 

(a t X + b l Y) dX + (a 2 X + b 2 Y)dY = 0 
in the variables X and Y. 

Case 2. If a 2 /a t = b 2 /b l = /c, then the transformation z = a x x + b t 
equation (2.52) to a separable equation in the variables x and z. 

Examples 2.19 and 2.20 illustrate the two cases of this theorem. 

► Example 2.19 

(x — 2y + 1) dx + (4x — 3y — 6) dy = 0. 

Here a t = 1, b x = — 2, a 2 = 4, b 2 = —3, and so 

a 2 b 2 3 a 2 

a i b i 2 a i 

Therefore this is Case 1 of Theorem 2.7. We make the transformation 

x = X + h, 

y=Y + k, 

where ( h , k) is the solution of the system 

h — 2k + 1 = 0, 

4h — 3k — 6 = 0. 

The solution of this system is h = 3, k = 2, and so the transformation is 

x = X + 3, 

y=Y + 2. 


(2.52) 


y reduces the 


(2.53) 
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This reduces Equation (2.53) to the homogeneous equation 

(X - 2 Y) dX + {AX - 3 Y) dY = 0. (2.54) 

Now following the procedure in Section 2.2 we first put this homogeneous equation in 
the form 


dY 1 - 2 (Y/X) 
dX ~ 3{Y/X) - A 

and let Y = vX to obtain 


x dv 1 — 2v 
V + X dX = 3p-4' 


This reduces to 


(3i> — 4) dv 
3v 2 -2v- 1 


dX 

~Y' 


Integrating (we recommend the use of tables here), we obtain 

|3i? — 3 


| In \3v 2 — 2v — 1 1 — | In 


3v + 1 


= —In \X\ + In |cj, 


or 


ln(3i; 2 — 2v — l) 2 - In 


3i? — 3 


3v + 1 


= In 


(2.55) 


or 


or, finally, 


In 


(3v + l) 5 
v- 1 



X 4 |(3i>+ 1) 5 | = c|i>- 1|, 


where c = c\. These are the solutions of the separable equation (2.55). Now replacing 
v by Y/X, we obtain the solutions of the homogeneous equation (2.54) in the form 

\3Y + X| 5 =c\Y - X\. 


Finally, replacing X by x — 3 and Y by y - 2 from the original transformation, we 
obtain the solutions of the differential equation (2.53) in the form 

|3(y - 2) + (x - 3)| 5 =c|y-2-x + 3| 


or 

|x + 3y-9| 5 = c\y - x + 1|. 


► Example 2.20 


(x + 2y + 3) dx + (2x + 4 y - 1) dy = 0. 


(2.56) 



2.4 SPECIAL INTEGRATING FACTORS AND TRANSFORMATIONS 67 


Here a i = 1, b l = 2, a 2 = 2, b 2 = 4, and a 2 /a { = b 2 /b l = 2. Therefore, this is Case 2 of 
Theorem 2.7. We therefore let 

z = x + 2y, 

and Equation (2.56) transforms into 

(z + 3 )dx + (2 z - l)^ dz = 0 

or 

7 dx + (2z — 1) dz = 0, 
which is separable. Integrating, we have 

lx + z 2 — z = c. 

Replacing z by x -f 2y we obtain the solution of Equation (2.56) in the form 

lx + (x + 2 y) 2 - (x + 2y) = c 


or 


x 2 -f 4xy + 4y 2 -f 6x — 2 y — c. 


C. Other Special Types and Methods; An Important Reference 

Many other special types of first-order equations exist for which corresponding special 
methods of solution are known. We shall not go into such highly specialized types 
in this book. Instead we refer the reader to the book Differ entialgleichungen: 
Losungsmethoden und Losungen , by E. Kamke (Chelsea, New York, 1948). This re- 
markable volume contains discussions of a large number of special types of equations 
and their solutions. We strongly suggest that the reader consult this book whenever he 
encounters an unfamiliar type of equation. Of course one may encounter an equation 
for which no exact method of solution is known. In such a case one must resort to 
various methods of approximation. We shall consider some of these general methods 
in Chapter 8. 


Exercises 

Solve each differential equation in Exercises 1-4 by first finding an integrating factor. 

1. (5 xy + 4 y 2 + 1) dx + (x 2 + 2 xy) dy = 0. 

2. (2x + tan y) dx + (x — x 2 tan y) dy = 0. 

3. [y 2 (x + 1) + y] dx + (2xy + 1) dy = 0. 

4. (2xy 2 + y) dx + (2y 3 — x)dy = 0. 

In each of Exercises 5 and 6 find an integrating factor of the form x p y q and solve. 

5. (4xy 2 + 6y) dx -f (5x 2 y + 8x) dy = 0. 

6. (8x 2 y 3 - 2y 4 ) dx + (5x 3 y 2 - 8xy 3 ) dy = 0. 
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Solve each differential equation in Exercises 7- 10 by making a suitable transformation. 

7. (5x -f- 2 y + 1) dx -h (2x -h y T 1) dy — 0. 

8. (3x — y + 1) dx — (6x — 2y - 3) dy = 0. 

9. (x — 2y — 3) dx + (2x + y — 1) dy = 0. 

10. (lOx — Ay -f 12) dx — (x + 5y + 3) dy = 0. 

Solve the initial-value problems in Exercises 11-14. 

11. (6x + Ay + 1) dx -f (4x + 2y + 2) dy = 0, y(|) = 3. 

12. (3x — y - 6) dx + (x + y + 2) dy = 0, y( 2) = - 2. 

13. (2x + 3y + 1) dx 4- (4x + 6y -j- 1) dy = 0, y( — 2) = 2. 

14. (4x + 3y + 1) dx + (x + y + 1) dy = 0, y(3) = -4. 

15. Prove Theorem 2.6. 

16. Prove Theorem 2.7. 

17. Show that if /x(x, y) and v(x , y) are integrating factors of 

M(x, y) dx + N(x 9 y) dy = 0 (A) 

such that /x(x, y)/v(x 9 y) is not constant, then 

ju(x, y ) = cv(x , y) 

is a solution of Equation (A) for every constant c. 

18. Show that if the equation 

M(x, y) dx -f A(x, y) dy = 0 (A) 

is homogeneous and M(x, y)x + N(x> y)y # 0, then l/[M(x, y)x + N(x, y)y] is 
an integrating factor of (A). 

19. Show that if the equation M(x, y) dx + N(x, y) dy = 0 is both homogeneous and 
exact and if M(x, y)x + N(x, y)y is not a constant, then the solution of this 
equation is M(x, y)x + N(x, y)y = c, where c is an arbitrary constant. 

20. An equation that is of the form 

y = px + f(p), (A) 

where p = dy/dx and / is a given function, is called a Clairaut equation. Given 
such an equation, proceed as follows: 

1. Differentiate (A) with respect to x and simplify to obtain 

[x + /,(p)] S = °- (B) 

Observe that (B) is a first-order differential equation in x and p. 

2. Assume x + f'(p) # 0, divide through by this factor, and solve the resulting 
equation to obtain 

p = c, (C) 

where c is an arbitrary constant. 
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3. Eliminate p between (A) and (C) to obtain 

y = cx + /(c). (D) 

Note that (D) is a one-parameter family of solutions of (A) and compare the form 
of differential equation (A) with the form of the family of solutions (D). 

4. Remark. Assuming x + f f (p) = 0 and then eliminating p between (A) and 
x + f'(p) = 0 may lead to an “extra” solution that is not a member of the one- 
parameter family of solutions of the form (D). Such an extra solution is usually 
called a singular solution. For a specific example, see Exercise 21. 


21. Consider the Clair aut equation 


y = px + p 2 , where p 


dy_ 

dx 


(a) Find a one-parameter family of solutions of this equation. 

(b) Proceed as in the Remark of Exercise 20 and find an “extra” solution that is 
not a member of the one-parameter family found in part (a). 

(c) Graph the integral curves corresponding to several members of the one- 
parameter family of part (a); graph the integral curve corresponding to the 
“extra” solution of part (b); and describe the geometric relationship between 
the graphs of the members of the one-parameter family and the graph of the 
“extra” solution. 



CHAPTER THREE 

Applications of First-Order Equations 


In Chapter 1 we pointed out that differential equations originate from the mathemati- 
cal formulation of a great variety of problems in science and engineering. In this 
chapter we consider problems that give rise to some of the types of first-order ordinary 
differential equations studied in Chapter 2. First, we formulate the problem mathemat- 
ically, thereby obtaining a differential equation. Then we solve the equation and 
attempt to interpret the solution in terms of the quantities involved in the original 
problem. 


3.1 ORTHOGONAL AND OBLIQUE TRAJECTORIES 
A. Orthogonal Trajectories 
DEFINITION 

Let 


F(x,y,c) = 0 (3.1) 

be a given one-parameter family of curves in the xy plane. A curve that intersects the 
curves of the family (3.1) at right angles is called an orthogonal trajectory of the given 
family. 
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► Example 3.1 

Consider the family of circles 


x 2 + y 2 = c 2 


(3.2) 
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with center at the origin and radius c. Each straight line through the origin, 

y = foe, (3.3) 

is an orthogonal trajectory of the family of circles (3.2). Conversely, each circle of the 
family (3.2) is an orthogonal trajectory of the family of straight lines (3.3). The families 
(3.2) and (3.3) are orthogonal trajectories of each other. In Figure 3.1 several members 
of the family of circles (3.2), drawn solidly, and several members of the family of 
straight lines (3.3), drawn with dashes, are shown. 

The problem of finding the orthogonal trajectories of a given family of curves arises 
in many physical situations. For example, in a two-dimensional electric field the lines of 
force (flux lines) and the equipotential curves are orthogonal trajectories of each other. 

We now proceed to find the orthogonal trajectories of a family of curves 


F(x, y, c) = 0. (3.1) 

We obtain the differential equation of the family (3.1) by first differentiating Equation 
(3.1) implicitly with respect to x and then eliminating the parameter c between the 
derived equation so obtained and the given equation (3.1) itself. We assume that the 
resulting differential equation of the family (3.1) can be expressed in the form 

l f = f(x,y ). (3.4) 

dx 

Thus the curve C of the given family (3.1) which passes through the point (x, y) has 
the slope /(x, y) there. Since an orthogonal trajectory of the given family intersects 
each curve of the family at right angles, the slope of the orthogonal trajectory to C at 
(x, y) is 


1 _ 

f{x,y)' 



Figure 3.1 
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Thus the differential equation of the family of orthogonal trajectories is 

dy _ 1 

dx~ /(x,y)‘ 

A one-parameter family 

G(x, y, c) = 0 


(3.5) 


or 


y = F(x,c) 

of solutions of the differential equation (3.5) represents the family of orthogonal 
trajectories of the original family (3.1), except possibly for certain trajectories that are 
vertical lines. 

We summarize this procedure as follows: 

Procedure for Finding the Orthogonal Trajectories of a 
Given Family of Curves 

Step 1. From the equation 

F(x,y 9 c) = 0 (3.1) 

of the given family of curves, find the differential equation 

£ = /(*,}0 (3.4) 


of this family. 

Step 2. In the differential equation dy/dx = f(x , y) so found in Step 1, replace 
f{x , y) by its negative reciprocal — l//(x, y). This gives the differential equation 


dx /(x,y) 

of the orthogonal trajectories. 

Step 3. Obtain a one-parameter family 

G(x, y, c) = 0 or y = F(x, c) 

of solutions of the differential equation (3.5), thus obtaining the desired family of 
orthogonal trajectories (except possibly for certain trajectories that are vertical lines 
and must be determined separately). 

Caution. In Step 1, in finding the differential equation (3.4) of the given family, be 
sure to eliminate the parameter c during the process. 

► Example 3.2 

In Example 3.1 we stated that the set of orthogonal trajectories of the family of circles 

x 2 + y 2 = c 2 (3.2) 
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is the family of straight lines 

y = foe. 

Let us verify this using the procedure outlined above. 


Step 1. Differentiating the equation 
of the given family, we obtain 


x 2 + y 2 = c 2 


(3.3) 


(3.2) 


dy 

x+ ^ = °- 

From this we obtain the differential equation 

dy x 

dx y 


(3.6) 


of the given family (3.2). (Note that the parameter c was automatically eliminated in this 
case.) 


Step 2. We replace — x/y by its negative reciprocal y/x in the differential equation 
(3.6) to obtain the differential equation 


<b_ = y_ 

dx x 


(3.7) 


of the orthogonal trajectories. 


Step 3. We now solve the differential equation (3.7). Separating variables, we have 


integrating, we obtain 


dy _ dx 

7 = V ; 

y = kx. 


(3.3) 


This is a one-parameter family of solutions of the differential equation (3.7) and thus 
represents the family of orthogonal trajectories of the given family of circles (3.2) 
(except for the single trajectory that is the vertical line x = 0 and this may be deter- 
mined by inspection). 


► Example 3.3 

Find the orthogonal trajectories of the family of parabolas y = cx 2 . 

Step L We first find the differential equation of the given family 

y = cx 2 . (3.8) 


Differentiating, we obtain 
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Figure 3.2 


Eliminating the parameter c between Equations (3.8) and (3.9), we obtain the differ- 
ential equation of the family (3.8) in the form 


di = 2y 

dx x * 


(3.10) 


Step 2. We now find the differential equation of the orthogonal trajectories by 
replacing 2 y/x in (3.10) by its negative reciprocal, obtaining 


dy x 

dx 2y 


(3.11) 


Step 3 . We now solve the differential equation (3.11). Separating variables, we have 

2 y dy = — x dx. 

Integrating, we obtain the one-parameter family of solutions of (3.1 1) in the form 

x 2 + 2 y 2 = k 2 , 

where k is an arbitrary constant. This is the family of orthogonal trajectories of (3.8); it 
is clearly a family of ellipses with centers at the origin and major axes along the x axis. 
Some members of the original family of parabolas and some of the orthogonal 
trajectories (the ellipses) are shown in Figure 3.2. 


B. Oblique Trajectories 

DEFINITION 

Let 


F(x, y,c) = 0 


(3.12) 
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be a one-parameter family of curves. A curve that intersects the curves of the family (3.12) 
at a constant angle a # 90° is called an oblique trajectory of the given family. 


Suppose the differential equation of a family is 


dy_ 

dx 


= fix, y). 


(3.13) 


Then the curve of the family (3. 1 3) through the point (x, y) has slope / (x, y) at (x, y) and 
hence its tangent line has angle of inclination tan _1 [/(x, y)] there. The tangent line 
of an oblique trajectory that intersects this curve at the angle a will thus have angle 
of inclination 


tan 1 [/(x, y)] + a 

at the point (x, y). Hence the slope of this oblique trajectory is given by 

/(x, y) + tan a 
1 - /(x, y)tan a * 

Thus the differential equation of such a family of oblique trajectories is given by 


t&n jtan l [f(x, y)] + a } = 


dy _ / (x, y) + tan a 
dx 1 — /(x, y)tan a 

Thus to obtain a family of oblique trajectories intersecting a given family of curves at 
the constant angle a # 90°, we may follow the three steps in the above procedure (page 
72) for finding the orthogonal trajectories, except that we replace Step 2 by the fol- 
lowing step: 


Step 2'. In the differential equation dy/dx = /(x, y) of the given family, replace 
/(x, y) by the expression 


/(x, y) + tan a 
1 - f(x, y) tan a ' 


(3.14) 


► Example 3.4 


Find a family of oblique trajectories that intersect the family of straight lines y = cx at 
angle 45°. 


Step 1. From y = cx, we find dy/dx = c. Eliminating c, we obtain the differential 
equation 


di = y_ 

dx x 


(3.15) 


of the given family of straight lines. 


Step 2’. We replace f(x, y) = y/x in Equation (3.15) by 

f(x, y) + tan a _ y/x + 1 _ x + y 
1 — f(x, y)tan a 1 — y/x x — y 
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(tan a = tan 45° = 1 here). Thus the differential equation of the desired oblique 
trajectories is 


dy _ x + y 
dx x — y 


(3.16) 


Step 3. We now solve the differential equation (3.16). Observing that it is a 
homogeneous differential equation, we let y = vx to obtain 


dv 

v + x — = 
dx 


1 + i> 

1 - v ’ 


After simplifications this becomes 

(v — 1) dv 


v 2 + 1 


dx 
x * 


Integrating we obtain 

\ \n(v 2 + 1) — arctan v = — In |x| — In \ c\ 


or 


In c 2 x 2 (v 2 + 1) — 2 arctan v = 0. 

Replacing v by y/x, we obtain the family of oblique trajectories in the form 


In c 2 (x 2 + y 2 ) — 2 arctan ^ = 0. 


Exercises 

In Exercises 1-9 find the orthogonal trajectories of each given family of curves. In each 
case sketch several members of the family and several of the orthogonal trajectories on 
the same set of axes. 

1 . y = cx 3 . 

3. cx 2 + y 2 = 1. 

5. y = x — 1 + ce 

7. x 2 -h y 2 — cx 3 . 

y 2 c 

9. x = — + -j. 

4 y l 

10. Find the orthogonal trajectories of the family of ellipses having center at the 
origin, a focus at the point (c, 0), and semimajor axis of length 2c. 

1 1. Find the orthogonal trajectories of the family of circles which are tangent to the y 
axis at the origin. 

12. Find the value of K such that the parabolas y = c t x 2 + K are the orthogonal 
trajectories of the family of ellipses x 2 + 2 y 2 — y = c 2 . 


2. y 2 = cx. 

4. y = e cx . 

6. x — y = cx 2 . 

8. x 2 = 2y — 1 + ce~ 2y . 



3.2 PROBLEMS IN MECHANICS 77 


13. Find the value of n such that the curves x n + y n = are the orthogonal 
trajectories of the family 

x 

^ 1 — c 2 x* 

14. A given family of curves is said to be self-orthogonal if its family of orthogonal 
trajectories is the same as the given family. Show that the family of parabolas 
y 2 = 2cx + c 2 is self-orthogonal. 

15. Find a family of oblique trajectories that intersect the family of circles x 2 + y 2 = 
c 2 at angle 45°. 

16. Find a family of oblique trajectories that intersect the family of parabolas y 2 = cx 
at angle 60°. 

17. Find a family of oblique trajectories that intersect the family of curves x + y = 
cx 2 at angle a such that tan a = 2. 


3.2 PROBLEMS IN MECHANICS 
A. Introduction 

Before we apply our knowledge of differential equations to certain problems in 
mechanics, let us briefly recall certain principles of that subject. The momentum of a 
body is defined to be the product mv of its mass m and its velocity v. The velocity v and 
hence the momentum are vector quantities. We now state the following basic law of 
mechanics: 

Newton's Second Law. The time rate of change of momentum of a body is 
proportional to the resultant force acting on the body and is in the direction of this 
resultant force. 

In mathematical language, this law states that 

j t (mv) = KF, 

where m is the mass of the body, v is its velocity, F is the resultant force acting upon it, 
and K is a constant of proportionality. If the mass m is considered constant, this 
reduces to 


or 


m ^ = KF, 
dt 


a = K 


m 


(3.17) 


or 


F = kma. 


(3.18) 
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where k = 1/K and a = dv/dt is the acceleration of the body. The form (3.17) is a direct 
mathematical statement of the manner in which Newton’s second law is usually 
expressed in words, the mass being considered constant. However, we shall make use of 
the equivalent form (3.18). The magnitude of the constant of proportionality k depends 
upon the units employed for force, mass, and acceleration. Obviously the simplest 
systems of units are those for which k = 1 . When such a system is used (3. 1 8) reduces to 

F = ma. (3.19) 

It is in this form that we shall use Newton’s second law. Observe that Equation (3.19) is 
a vector equation. 

Several systems of units for which k = 1 are in use. In this text we shall use only three: 
the British gravitational system (British), the centimeter-gram-second system (cgs), and 
the meter-kilogram-second system (mks). We summarize the various units of these 
three systems in Table 3.1. 

Recall that the force of gravitational attraction that the earth exerts on a body is 
called the weight of the body. The weight, being a force, is expressed in force units. Thus 
in the British system the weight is measured in pounds; in the cgs system, in dynes; and 
in the mks system, in newtons. 

Let us now apply Newtons’s second law to a freely falling body (a body falling 
toward the earth in the absence of air resistance). Let the mass of the body be m and let 
w denote its weight. The only force acting on the body is its weight and so this is the 
resultant force. The acceleration is that due to gravity, denoted by g , which is 
approximately 32 ft/sec 2 in the British system, 980 cm/sec 2 in the cgs system, and 
9.8 m/sec 2 in the mks system (for points near the earth’s surface). Newton’s second law 
F = ma thus reduces to w = mg. Thus 



(3.20) 


a relation that we shall frequently employ. 

Let us now consider a body B in rectilinear motion, that is, in motion along a straight 
line L. On L we choose a fixed reference point as origin O , a fixed direction as posi- 
tive, and a unit of distance. Then the coordinate x of the position of B from the origin 0 
tells us the distance or displacement of B. (See Figure 3.3.) The instantaneous velocity of 
B is the time rate of change of x: 



TABLE 3.1 



British System 

cgs System 

mks System 

force 

pound 

dyne 

newton 

mass 

slug 

gram 

kilogram 

distance 

fool 

centimeter 

meter 

time 

second 

second 

second 

acceleration 

ft/sec 2 

cm/sec 2 

m/sec 2 
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Figure 3.3 


and the instantaneous acceleration of B is the time rate of change of v: 

dv d 2 x 
dt dt 2 * 

Note that x,v, and a are vector quantities. All forces, displacements, velocities, and 
accelerations in the positive direction on L are positive quantities; while those in the 
negative direction are negative quantities. 

If we now apply Newton’s second law F — ma to the motion of B along L, noting 
that 

dv dv dx dv 

dt dx dt dx ’ 

we may express the law in any of the following three forms: 


dv 



(3.21) 

d 2 x 

(3.22) 

3 

<-* 

K)| 

II 

dv 

(3.23) 

mv— = F , 
dx 


where F is the resultant force acting on the body. The form to use depends upon the way 
in which F is expressed. For example, if F is a function of time t only and we desire to 
obtain the velocity v as a function of t , we would use (3.21); whereas if F is expressed as a 
function of the displacement x and we wish to find v as a function of x, we would 
employ (3.23). 

B. Falling Body Problems 

We shall now consider some examples of a body falling through air toward the earth. In 
such a circumstance the body encounters air resistance as it falls. The amount of air 
resistance depends upon the velocity of the body, but no general law exactly expressing 
this dependence is known. In some instances the law R = kv appears to be quite 
satisfactory, while in others R = kv 2 appears to be more exact. In any case, the con- 
stant of proportionality k in turn depends on several circumstances. In the ex- 
amples that follow we shall assume certain reasonable resistance laws in each case. 
Thus we shall actually be dealing with idealized problems in which the true resis- 
tance law is approximated and in which certain comparatively negligible factors are 
disregarded. 
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► Example 3.5 

A body weighing 8 lb falls from rest toward the earth from a great height. As it falls, air 
resistance act upon it, and we shall assume that this resistence (in pounds) is 
numerically equal to 2v, where v is the velocity (in feet per second). Find the velocity and 
distance fallen at time t seconds. 


Formulation. We choose the positive x axis vertically downward along the path of 
the body B and the origin at the point from which the body fell. The forces acting on the 
body are: 

1. F 1? its weight, 8 lb, which acts downward and hence is positive. 

2. F 2 , the air resistance, numerically equal to 2v, which acts upward and hence is the 
negative quantity — 2v. 

See Figure 3.4, where these forces are indicated. 


Newton’s second law, F = ma, becomes 


dv 

m n = F ' + F ‘ 

or, taking g = 32 and using m = w/g = jz = 4 , 

1 dv 

- — = 8 - 2t>. 

4 dt 

Since the body was initially at rest, we have the initial condition 

v(0) = 0. 


(3.24) 


(3.25) 


0 


x 


B 


F 2 = -2v 

F x =8 


+ 


Earth 

Figure 3.4 
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Solution. Equation (3.24) is separable. Separating variables, we have 


dv 


8 2t; 


= 4 dt. 


Integrating we find 

— j In 1 8 — 2v\ = 4f + c 0 , 

which reduces to 


8 — 2v = c x e~ % \ 

Applying the condition (3.25) we find c x = 8. Thus the velocity at time t is given by 

v = 4(1 -e~ st ). (3.26) 

Now to determine the distance fallen at time t, we write (3.26) in the form 

1 = 4,1 -.'I 

and note that x(0) = 0. Integrating the above equation, we obtain 

x = 4 (t + fe“ 8 ') + c 2 . 

Since x = 0 when t = 0, we find c 2 = — i and hence the distance fallen is given by 

x = 4 (t + ie“ 8f -i). (3.27) 

Interpretation of Results. Equation (3.26) shows us that as t oo, the velocity v 
approaches the limiting velocity 4(ft/sec). We also observe that this limiting velocity is 
approximately attained in a very short time. Equation (3.27) states that as t -> oo, x also 
-► oo. Does this imply that the body will plow through the earth and continue forever? 
Of course not; for when the body reaches the earth’s surface its motion will certainly 
cease. How then do we reconcile this obvious end to the motion with the statement of 
Equation (3.27)? It is simple; when the body reaches the earth’s surface, the differential 
equation (3.24) and hence Equation (3.27) no longer apply! 

► Example 3.6 

A skydiver equipped with parachute and other essential equipment falls from rest 
toward the earth. The total weight of the man plus the equipment is 160 lb. Before the 
parachute opens, the air resistance (in pounds) is numerically equal to ft;, where v is the 
velocity (in feet per second). The parachute opens 5 sec after the fall begins; after it 
opens, the air resistance (in pounds) is numerically equal to ft; 2 , where v is the velocity 
(in feet per second). Find the velocity of the skydiver (A) before the parachute opens, 
and (B) after the parachute opens. 

Formulation. We again choose the positive x axis vertically downward with the 
origin at the point where the fall began. The statement of the problem suggests that we 
break it into two parts: (A) before the parachute opens; (B) after it opens. 

We first consider problem (A). Before the parachute opens, the forces acting upon the 
skydiver are: 

1. F x , the weight, 160 lb, which acts downward and hence is positive. 
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2. F 2 , the air resistance, numerically equal to which acts upward and hence is the 
negative quantity —\v. 

We use Newton’s second law F = ma , where F = F x + F 2 , let m = w/g, and take 
g = 32. We obtain 

5^ = 160 -\v. 
dt 

Since the skydiver was initially at rest, v = 0 when t = 0. Thus, problem (A), concerned 
with the time before the parachute opens, is formulated as follows: 


5 ^ = 160 — 
dt 


(3.28) 


v(0) = 0. (3.29) 

We now turn to the formulation of problem (B). Reasoning as before, we see that 
after the parachute opens, the forces acting upon the skydiver are: 

1. F l = 160, exactly as before. 

2. F 2 = — f y 2 (instead of — %v). 


Thus, proceeding as above, we obtain the differential equation 

5^= 160-|i; 2 . 
dt 

Since the parachute opens 5 sec after the fall begins, we have v = v l when t = 5, where 
v x is the velocity attained when the parachute opened. Thus, problem (B), concerned 
with the time after the parachute opens, is formulated as follows: 


5^= 160 — |y 2 , 
f(5) = v v 


(3.30) 

(3.31) 


Solution. We shall first consider problem (A). We find a one-parameter family of 
solution of 


5 


dv 

dt 


160 - it?. 


(3.28) 


Separating variables, we obtain 

dv , , 

tT^320 _ " 10 dt ' 

Integration yields 

ln(u - 320) = - jot + Cq, 
which readily simplifies to the form 

v = 320 + ce~ t,l °. 



3.2 PROBLEMS IN MECHANICS 83 


Applying the initial condition (3.29) that v = 0 at t = 0, we find that c = — 320. Hence 
the solution to problem (A) is 

v = 320(1 — e~ t/10 ) 9 (3.32) 

which is valid for 0 < t < 5. In particular, where t = 5, we obtain 

v x = 320(1 -e~ l/2 )x 126, (3.33) 

which is the velocity when the parachute opens. 

Now let us consider problem (B). We first find a one-parameter family of solutions of 
the differential equation 

5^ =160 -f» 2 (3.30) 

at 

Simplifying and separating variables, we obtain 

dv dt 

v 2 - 256 = T’ 

Integration yields 

1 , v- 16 t 

M n „ , U" _ 0 + C 2 


or 


In 


i; — 16 

o + 16 


-4r + c t . 


This readily simplifies to the form 


t? — 16 

v + 16 


= ce 


-At 


and solving this for v we obtain 


16(ce~ 4t + 1) 
l-ce~ 4t ' 


(3.34) 


(3.35) 


Applying the initial condition (3.31) that v = v t at t = 5, where is given by (3.33) and 
is approximately 126, to (3.34), we obtain 


Substituting this into (3.35) we obtain 


v = 


16(H§e 2 °-*'+ 1) 

1 110^20-4r ’ 

1 “ 142 ^ 


(3.36) 


which is valid for t > 5. 


Interpretation of Results. Let us first consider the solution of problem (A), given 
by Equation (3.32). According to this, as t -* oo, v approaches the limiting velocity 
320 ft/sec. Thus if the parachute never opened, the velocity would have been approx- 
imately 320 ft/sec at the time when the unfortunate skydiver would have struck the 
earth! But, according to the statement of the problem, the parachute does open 5 sec 
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after the fall begins (we tacitly and thoughfully assume 5 « T, where T is the time 
wh/yj* tot 'yyito ra/tottoy Tb/tto vtbtvibi/g to tot ^ 1 2 vqiati&h 

(3.36), we see that as t -* oo, v approaches the limiting velocity 16 ft/sec. Thus, assuming 
that the parachute opens at a considerable distance above the earth, the velocity is 
approximately 16 ft /sec when the earth is finally reached. We thus obtain the well- 
known fact that the velocity of impact with the open parachute is a small fraction of 
the impact velocity that would have occured if the parachute had not opened. The 
calculations in this problem are somewhat complicated, but the moral is clear: Make 
certain that the parachute opens! 


C. Frictional Forces 

If a body moves on a rough surface, it will encounter not only air resistance but also 
another resistance force due to the roughness of the surface. This additional force is 
called friction. It is shown in physics that the friction is given by /*N, where 

1. n is a constant of proportionality called the coefficient of friction , which depends 
upon the roughness of the given surface; and 

2. N is the normal (that is, perpendicular) force which the surface exerts on the body. 

We now apply Newton’s second law to a problem in which friction is involved. 

► Example 3.7 

An object weighing 48 lb is released from rest at the top of a plane metal slide that is 
inclined 30° to the horizontal. Air resistance (in pounds) is numerically equal to one- 
half the velocity (in feet per second), and the coefficient of friction is one-quarter. 

A. What is the velocity of the object 2 sec after it is released? 

B. If the slide is 24 ft long, what is the velocity when the object reaches the bottom? 

Formulation. The line of motion is along the slide. We choose the origin at the top 
and the positive x direction down the slide. If we temporarily neglect the friction and air 
resistance, the forces acting upon the object A are: 

1. Its weight, 48 lb, which acts vertically downward; and 

2. The normal force, N , exerted by the slide which acts in an upward direction 
perpendicular to the slide. (See Figure 3.5.) 

The components of the weight parallel and perpendicular to the slide have magnitude 

48 sin 30° = 24 


and 

48 cos 30° = 24^/3, 

respectively, The components perpendicular to the slide are in equilibrium and hence 
the normal force N has magnitude 24^/3. 
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Now, taking into consideration^ friction and air resistance, we see that the forces 
acting on the object as it moves along the slide are the following: 

1. F 1 , the component on the weight parallel to the plane, having numerical value 24. 
Since this force acts in the positive (downward) direction along the slide, we have 

F i = 24. 

2. F 2 , the frictional force, having numerical value g N = *(24^/3). Since this acts in the 
negative (upward) direction along the side, we have 

F 2 = - 6^3. 

3. F 3 , the air resistance, having numerical value \v. Since v > 0 and this also acts in 
the negative direction, we have 

Fs = -iv. 

We apply Newton’s second law F = ma. Here F = F t + F 2 + F 3 = 24 — 6^/3 — \v 
and m = w/g = f| = |. Thus we have the differential equation 

~ = 24-6V3-i». (3.37) 

Since the object is released from rest, the initial condition is 

0(0) = 0. (3.38) 

Solution. Equation (3.37) is separable; separating variables we have 

dv dt 

48 - 12^/3 - v ~ 3 ' 

Integrating and simplifying, we find 

v = 48 — 12^/3 — c t e~ t/3 . 

The condition (3.38) gives c t = 48 — 12^/3. Thus we obtain 

y = (48 — 12 x /3)(l -e~ t/3 ). (3.39) 

Question A is thus answered by letting t — 2 in Equation (3.39). We find 
v(2) = (48 - 12y3)( 1 - e~ 213 ) * 10.2(ft/sec). 
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In order to answer question B, we integrate (3.39) to obtain 
x = (48 — 12^/3 )(t + 3e~ t/3 ) + c 2 . 

Since x(0) = 0, c 2 = —(48 — 12 v / / 3)(3). Thus the distance covered at time t is given by 
x = (48 - 12^3 )(t + 3e~ t/3 - 3). 

Since the slide is 24 ft long, the object reaches the bottom at the time T determined from 
the transcendental equation 

24 = (48 - 12^/3)(T + 3e ~ T/ 3 - 3), 

which may be written as 

r/3 _ 47 + 2^/3 _ 

13 

The value of T that satisfies this equation is approximately 2.6. Thus from Equation 
(3.39) the velocity of the object when it reaches the bottom is given approximately by 

(48 - 12 v /3)(l - e~ 0 - 9 ) » 12.3 (ft/sec). 


Exercises 

1. A stone weighing 4 lb falls from rest toward the earth from a great height. As it 
falls it is acted upon by air resistance that is numercially equal to (in pounds), 
where v is the velocity (in feet per second). 

(a) Find the velocity and distance fallen at time t sec. 

(b) Find the velocity and distance fallen at the end of 5 sec. 

2. A ball weighing 6 lb is thrown vertically downward toward the earth from a height 
of 1000 ft with an initial velocity of 6 ft/sec. As it falls it is acted upon by air 
resistance that is numerically equal to ft; (in pounds), wheret; is the velocity (in feet 
per second). 

(a) What is the velocity and distance fallen at the end of one minute? 

(b) With what velocity does the ball strike the earth? 

3. A ball weighing f lb is thrown vertically upward from a point 6 ft above the 
surface of the earth with an initial velocity of 20 ft/sec. As it rises it is acted upon 
by air resistance that is numerically equal to ^ v (in pounds), where v is the 
velocity (in feet per second). How high will the ball rise? 

4. A ship which weighs 32,000 tons starts from rest under the force of a constant 
propeller thrust of 100,000 lb. The resistance in pounds is numerically equal to 
8000i;, where v is in feet per second. 

(a) Find the velocity of the ship as a function of the time. 

(b) Find the limiting velocity (that is, the limit of v as t -► + 00 ). 

(c) Find how long it takes the ship to attain a velocity of 80% of the limiting 
velocity. 

5. A body of mass 100 g is dropped from rest toward the earth from a height of 
1000 m. As it falls, air resistance acts upon it, and this resistance (in newtons) is 
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proportional to the velocity v (in meters per second). Suppose the limiting velocity 
is 245 m/sec. 

(a) Find the velocity and distance fallen at time t secs. 

(b) Find the time at which the velocity is one-fifth of the limiting velocity. 

6. An object of mass 100 g is thrown vertically upward from a point 60 cm above the 
earth’s surface with an initial velocity of 150 cm/sec. It rises briefly and then falls 
vertically to the earth, all of which time it is acted on by air resistance that is 
numerically equal to 200i> (in dynes), where v is the velocity (in cm/sec). 

(a) Find the velocity 0.1 sec after the object is thrown. 

(b) Find the velocity 0.1 sec after the object stops rising and starts falling. 

7. Two people are riding in a motorboat and the combined weight of individuals, 
motor, boat, and equipment is 640 lb. The motor exerts a constant force of 20 lb 
on the boat in the direction of motion, while the resistance (in pounds) is 
numerically equal to one and one-half times the velocity (in feet per second). If the 
boat started from rest, find the velocity of the boat after (a) 20 sec, (b) 1 min. 

8. A boat weighing 150 lb with a single rider weighing 170 lb is being towed in a 
certain direction at the rate of 20 mph. At time t = 0 the tow rope is suddenly cast 
off and the rider begins to row in the same direction, exerting a force equivalent to 
a constant force of 1 2 lb in this direction. The resistance (in pounds) is numerically 
equal to twice the velocity (in feet per second). 

(a) Find the velocity of the boat 15 sec after the tow rope was cast off. 

(b) How many seconds after the tow rope is cast off will the velocity be one-half 
that at which the boat was being towed? 

9. A bullet weighing 1 oz is fired vertically downward from a stationary helicopter 
with a muzzle velocity of 1200 ft/sec. The air resistance (in pounds) is numerically 
equal to 1 6 “ 5 v 2 , where v is the velocity (in feet per second). Find the velocity of the 
bullet as a function of the time. 

10. A shell weighing 1 lb is fired vertically upward from the earth’s surface with a 
muzzle velocity of 1000 ft/sec. The air resistance (in pounds) is numerically equal 
to 10 -4 t; 2 , where v is the velocity (in feet per second). 

(a) Find the velocity of the rising shell as a function of the time. 

(b) How long will the shell rise? 

11. An object weighing 16 lb is dropped from rest on the surface of a calm lake and 
thereafter starts to sink. While its weight tends to force it downward, the 
buoyancy of the object tends to force it back upward. If this buoyancy force is one 
of 6 lb and the resistance of the water (in pounds) is numerically equal to twice the 
square of the velocity (in feet per second), find the formula for the velocity of the 
sinking object as a function of the time. 

12. An object weighing 12 lb is placed beneath the surface of a calm lake. The 
buoyancy of the object is 30 lb; because of this the object begins to rise. If 
the resistance of the water (in pounds) is numerically equal to the square of the 
velocity (in feet per second) and the object surfaces in 5 sec, find the velocity of the 
object at the instant when it reaches the surface. 
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13. A man is pushing a loaded sled across a level field of ice at the constant speed of 
10 ft/sec. When the man is halfway across the ice field, he stops pushing and lets 
the loaded sled continue on. The combined weight of the sled and its load is 80 lb; 
the air resistance (in pounds) is numerically equal to ft?, where v is the velocity of 
the sled (in feet per second); and the coefficient of friction of the runners on the ice 
is 0.04. How far will the sled continue to move after the man stops pushing? 

14. A girl onjier sled has just slid down a hill onto a level field of ice and is starting to 
slow down. At the instant when their speed is 5 ft/sec, the girl’s father runs up and 
begins to push the sled forward, exerting a constant force of 15 lb in the direction 
of motion. The combined weight of the girl and the sled is 96 lb, the air resistance 
(in pounds) is numerically equal to one-half the velocity (in feet per second), and 
the coefficient of friction of the runners on the ice is 0.05. How fast is the sled 
moving 10 sec after the father begins pushing? 

15. A case of canned milk weighing 24 lb is released from rest at the top of a plane 
metal slide which is 30 ft long and inclined 45° to the horizontal. Air resistance (in 
pounds) is numerically equal to one-third the velocity (in feet per second) and the 
coefficent of friction is 0.4. 

(a) What is the velocity of the moving case 1 sec after it is released? 

(b) What is the velocity when the case reaches the bottom of the slide? 

16. A boy goes sledding down a long 30° slope. The combined weight of the boy and 
his sled is 72 lb and the air resistance (in pounds) is numerically equal to twice their 
velocity (in feet per second). If they started from rest and their velocity at the end 
of 5 sec is 10 ft/sec, what is the coefficient of friction of the sled runners on the 
snow? 

17. An object weighing 32 lb is released from rest 50 ft above the surface of a calm 
lake. Before the object reaches the surface of the lake, the air resistance (in pounds) 
is given by 2v, where v is the velocity (in feet per second). After the object passes 
beneath the surface, the water resistance (in pounds) is given by 6v. Further, the 
object is then buoyed up by a buoyancy force of 8 lb. Find the velocity of the 
object 2 sec after it passes beneath the surface of the lake. 

18. A rocket of mass m is fired vertically upward from the surface of the earth with 
initial velocity v = v 0 . The only force on the rocket that we consider is the 
gravitational attraction of the earth. Then, according to Newton’s law of 
gravitation, the acceleration a of the rocket is given by a—— k/x 2 , where k > 0 
is a constant of proportionality and x is the distance “upward” from the center 
of the earth along the line of motion. At time t = 0, x = R (where R is the radius 
of the earth), a——g (where g is the acceleration due to gravity), and v = v 0 . 
Express a = dv/dt as in Equation (3.23), apply the appropriate initial data, and 
note that v satisfies the differential equation 

dv gR 2 

V dx x 2 

Solve this differential equation, apply the appropriate initial condition, and thus 
express v as a function of x. In particular, show t hat t he minimum value of v 0 for 
which the rocket will escape from the earth is^JlgR. This is the so-called veloc- 
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ity of escape ; and using R = 4000 miles, g = 32 ft/sec 2 , one finds that this is 
approximately 25,000 mph (or 7 mi/sec). 

19. A body of mass m is in rectilinear motion along a horizontal axis. The resultant 
force acting on the body is given by — kx, where k > 0 is a constant of 
proportionality and x is the distance along the axis from a fixed point O. The body 
has initial velocity v = v 0 when x = x 0 . Apply Newton’s second law in the form 
(3.23) and thus write the differential equation of motion in the form 

dv . 
mv — = —kx. 
dx 

Solve the differential equation, apply the initial condition, and thus express the 
square of the velocity v as a function of the distance x. Recalling that v = dx/dt , 
show that the relation between v and x thus obtained is satisfied for all time t by 



where </> is a constant. 


3.3 RATE PROBLEMS 

In certain problems the rate at which a quantity changes is a known function of the 
amount present and/or the time, and it is desired to find the quantity itself. If x denotes 
the amount of the quantity present at time t, then dx/dt denotes the rate at which the 
quantity changes and we are at once led to a differential equation. In this section we 
consider certain problems of this type. 


A. Rate of Growth and Decay 
► Example 3.8 

The rate at which radioactive nuclei decay is proportional to the number of such nuclei 
that are present in a given sample. Half of the original number of radioactive nuclei 
have undergone disintegration in a period of 1500 years. 

1. What percentage of the original radioactive nuclei will remain after 4500 years? 

2. In how many years will only one-tenth of the original number remain? 

Mathematical Formulation. Let x be the amount of radioactive nuclei present 
after t years. Then dx/dt represents the rate at which the nuclei decay. Since the nuclei 
decay at a rate proportional to the amount present, we have 

Y t = Kx, (3.40) 

where K is a constant of proportionality. The amount x is clearly positive; further, since 
x is decreasing, dx/dt < 0. Thus, from Equation (3.40), we must have K < 0. In order to 
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emphasize that x is decreasing, we prefer to replace K by a positive constant preceded 
by a minus sign. Thus we let k = — K > 0 and write the differential equation (3.40) in 
the form 


dx 

li 


= — fcx. 


(3.41) 


Letting x 0 denote the amount initially present, we also have the initial condition 

x(0) = x o . (3.42) 

We know that we shall need such a condition in order to determine the arbitrary 
constant that will appear in a one-parameter family of solutions of the differential 
equation (3.41). However, we shall apparently need something else, for Equation (3.41) 
contains an unknown constant of proportionality k. This “something else” appears in 
the statement of the problem, for we are told that half of the original number 
disintegrate in 1500 years. Thus half also remain at that time, and this at once gives the 
condition 

x(1500) = ix o . (3.43) 

Solution. The differential equation (3.41) is clearly separable; separating variables, 
integrating, and simplifying, we have at once 

x = ce~ kt . 


Applying the initial condition (3.42), x = x 0 when t = 0, we find that c = x 0 and hence 
we obtain 

x = x 0 e“*'. (3.44) 

We have not yet determined k. Thus we now apply condition (3.43), x = ^x 0 when 
t = 1500, to Equation (3.44). We find 

_1_ -y* __ -y* S>~ 1500fc 

2 A 0 — A 0 C , 


or 


or finally 


(e-*)1500 = i 


(i) 1/150 ° 


(3.45) 


From this equation we could determine k explicitly and substitute the result into 
Equation (3.44). However, we see from Equation (3.44) that we actually do not need k 
itself but rather only e ~ k , which we have just obtained in Equation (3.45). Thus we 
substitute e~ k from (3.45) into (3.44) to obtain 

x = x 0 (e-‘)' = x 0 [(i) 1 /‘ 500 ]' 


or 

x = x 0 (^ /150 °. (3.46) 

Equation (3.46) gives the number x of radioactive nuclei that are present at time f. 
Question 1 asks us what percentage of the original number will remain after 4500 years. 
We thus let t = 4500 in Equation (3.46) and find 

X = *o(i ) 3 = 8*0- 
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Thus, one-eighth or 12.5% of the original number remain after 4500 years. Question 2 
asks us when only one-tenth will remain. Thus we let x = ^x 0 in Equation (3.46) and 
solve for t. We have 

_1 AW/ 1500 

io - (2) 

Using logarithms, we then obtain 


ln(A) = ln(ir oo =^ln(i). 

From this it follows at once that 

[ = Injo 

1500 In j 


or 


t = 


1500 In 10 
In 2 


% 4985 (years). 


B. Population Growth 

We next consider the growth of a population (for example, human, an animal species, 
or a bacteria colony) as a function of time. Note that a population actually increases 
discontinuously by whole number amounts. However, if the population is very large, 
such individual increases in it are essentially negligible compared to the entire pop- 
ulation itself. In other words, the population increase is approximately continuous. 
We shall therefore assume that this increase is indeed continuous and in fact that the 
population is a continuous and differentiable fucntion of time. 

Given a population, we let x be the number of individuals in it at time t. If we assume 
that the rate of change of the population is proportional to the number of individuals 
in it at any time, we are led to the differential equation 

^ = kx, (3.47) 

where k is a constant of proportionality. The population x is positive and is increasing 
and hence dx/dx > 0. Therefore, from (3.47), we must have k > 0. Now suppose that at 
time t 0 the population is x 0 . Then, in addition to the differential equation (3.47), we 
have the initial condition 


x(t 0 ) = x 0 . (3.48) 

The differential equation (3.47) is separable. Separating variables, integrating, and 
simplifying, we obtain 

x = ce kt . 

Applying the initial condition (3.48), x = x 0 at t = t 0 , to this, we have x 0 = ce kt0 . From 
this we at once find c = x 0 e~ kto and hence obtain the unique solution 

x = x 0 e fc( '~' o) 

of the differential equation (3.47), which satisfies the initial condition (3.48). 


(3.49) 
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From (3.49) we see that a population governed by the differential equation (3.47) with 
k > 0 and initial condition (3.48) is one that increases exponentially with time. This law 
of population growth is called the Malthusian law. We should now inquire whether or 
not there are cases in which such a model for population growth is indeed realistic. In 
answer to this, it can be shown that this model, with a suitable value of k , is remarkably 
accurate in the case of the human population of the earth during the last several 
decades (see Problem 8(b)). It is also known to be outstandingly accurate for certain 
mammalian species, with suitable k, under certain realizable conditions and for certain 
time periods. On the other hand, turning back to the case of the human population of 
the earth, it can be shown that the Malthusian law turns out to be quite unreasonable 
when applied to the distant future (see Problem 8(e)). It is also completely unrealistic for 
other populations (for example, bacteria colonies) when applied over sufficiently long 
periods of time. The reason for this is not hard to see. For, according to (3.49), a 
population modeled by this law always increases and indeed does so at an ever 
increasing rate; whereas observation shows that a given population simply does not 
grow indefinitely. 

Population growth is represented more realistically in many cases by assuming that 
the number of individuals x in the population at time t is described by a differential 
equation of the form 

^ = kx — Xx 2 , (3.50) 

at 

where k > 0 and X > 0 are constants. The additional term — Xx 2 is the result of some 
cause that tends to limit the ultimate growth of the population. For example, such a 
cause could be insufficient living space or food supply, when the population becomes 
sufficiently large. Concerning the choice of - Xx 2 for the term representing the effect of 
the cause, one can argue as follows: Assuming the cause affects the entire population of 
x members, then the effect on any one individual is proportional to x. Thus the effect on 
all x individuals in the population would be proportional to x • x = x 2 . 

We thus assume that a population is described by a differential equation of the form 
(3.50), with constants k > 0 and X > 0, and an initial condition of the form (3.48). In 
most such cases, it turns out that the constant X is very small compared to the constant 
k. Thus for sufficiently small x, the term kx predominates, and so the population grows 
very rapidly for a time. However, when x becomes sufficiently large, the term — Xx 2 is of 
comparatively greater influence, and the result of this is a decrease in the rapid growth 
rate. We note that the differential equation (3.50) is both a separable equation and a 
Bernoulli equation. The law of population growth so described is called the logistic law 
of growth. We now consider a specific example of this type of growth. 


► Example 3.9 

The population x of a certain city satisfies the logistic law 

dx 1 1 2 

a" = Too x_ (io) 8 x 


(3.51) 


where time t is measured in years. Given that the population of this city is 100,000 in 
1980, determine the population as a function of time for t > 1980. In particular, answer 
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the following questions: 

(a) What will be the population in 2000? 

(b) In what year does the 1980 population double? 

(c) Assuming the differential equation (3.51) applies for all t > 1980, how large will 
the population ultimately be? 

Solution. We must solve the separable differential equation (3.51) subject to the 
initial solution 

x(1980) = 100,000. (3.52) 

Separating variables in (3.51), we obtain 


(10)~ 2 x — (10) -8 x 2 

and hence 


dx 


(10)“ 2 x[l — (10) _6 x] 
Using partial fractions, this becomes 

( 10)- 6 


= dt. 


100 - 


"1 

_x + r 


dx = dt. 


(10)~ 6 x 

Integrating, assuming 0 < x < 10 6 , we obtain 

100{ln x — ln[l — (10) _6 x]} = t + c t 

and hence 

i 

X 


In 


_1 - 


(10) -6 x 


- 100 ^ + C 2- 


Thus we find 


1 - (10) _6 x 

Solving this for x, we finally obtain 


= ce t/10 ° . 


ce 


t / 100 


X = 


1 +(10r 6 a? f/100 ’ 

Now applying the initial condition (3.52) to this, we have 


(10) 5 = 


ce 


19.8 


from which we obtain 


c = 


1 +(10)- 6 o? 19 - 8 ’ 

(10) 5 (10) 6 


e 19 *l\ -(10) 5 (10)" 6 ] 9^ 19 * 8 


(3.53) 


Substituting this value for c back into (3.53) and simplifying, we obtain the solution in 
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the form 


( 10) 6 

1 9^19.8-r/lOO * 


(3.54) 


This gives the population x as a function of time for t > 1980. 

We now consider the questions (a), (b), and (c) of the problem. Question (a) asks for 
the population in the year 2000. Thus we let t = 2000 in (3.54) and obtain 


x = 


( 10) 6 


1 + 9e~ 


119,495. 


Question (b) asks for the year in which the population doubles. Thus we let 
x = 200,000 = 2(10) 5 in (3.54) and solve for t . We have 

2(10 ,». m 

K ’ \ _|_ 9^19.8 -f/100 ’ 

from which 


19.8-f/100 _ 4 
— 9 ? 


and hence 


r ^ 2061. 

Question (c) asks how large the population will ultimately be, assuming the differential 
equation (3.51) applies for all t > 1980. To answer this, we evaluate limx as t oo using 
the solution (3.54) of (3.51). We find 

( 10) 6 

lim x = lim - 19 , 8 „, /100 = (10) 6 = 1,000,000. 

t-*ao t-> oo It • y€ 


C. Mixture Problems 

We now consider rate problems involving mixtures. A substance S is allowed to flow 
into a certain mixture in a container at a certain rate, and the mixture is kept uniform by 
stirring. Further, in one such situation, this uniform mixture simultaneously flows out 
of the container at another (generally different) rate; in another situation this may not 
be the case. In either case we seek to determine the quantity of the substance S present 
in the mixture at time t. 

Letting x denote the amount of S present at time t , the derivative dx/dt denotes the 
rate of change of x with respect to t. If IN denotes the rate at which S enters the mixture 
and OUT the rate at which it leaves, we have at once the basic equation 

dx 

— = IN - OUT (3.55) 

dt 

from which to determine the amount x of S at time t. We now consider examples. 


► Example 3.1 0 

A tank initially contains 50 gal of pure water. Starting at time t = 0 a brine containing 
2 lb of dissolved salt per gallon flows into the tank at the rate of 3 gal/min. The mixture 
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is kept uniform by stirring and the well-stirred mixture simultaneously flows out of the 
tank at the same rate. 


1. How much salt is in the tank at any time t > 0? 

2. How much salt is present at the end of 25 min? 

3. How much salt is present after a long time? 


Mathematical Formulation. Let x denote the amount of salt in the tank at time t . 
We apply the basic equation (3.55), 


^ = IN — OUT. 
dt 


The brine flows in at the rate of 3 gal/min, and each gallon contains 2 lb of salt. Thus 


IN = (2 lb/gal)(3 gal/min) — 6 lb/min. 

Since the rate of outflow equals the rate of inflow, the tank contains 50 gal of the 
mixture at any time t . This 50 gal contains x lb of salt at time r, and so the concentration 
of salt at time t is ^x lb/gal. Thus, since the mixture flows out at the rate of 3 gal/min, we 
have 

°UT = ^ lb/gal^(3 gal/min) = ^ lb/min. 

Thus the differential equation for x as a function of t is 


dx 3x 
di = ”50* 


(3.56) 


Since initially there was no salt in the tank, we also have the initial condition 

x(0) = 0. (3.57) 


Solution. Equation (3.56) is both linear and separable. Separating variables, we 
have 


dx 3 
100 — x 50 

Integrating and simplifying, we obtain 

x = 100 + ce~ 31150 . 

Applying the condition (3.57), x = 0 at t = 0, we find that c = — 100. Thus we have 

x = 100(1 -e~ 3t/50 ). (3.58) 

This is the answer to question 1. As for question 2, at the end of 25 min, t = 25, and 
Equation (3.58) gives 

x(25) = 100(1 — e~ 1 ' 5 ) » 78(lb). 

Question 3 essentially asks us how much salt is present as t -► oo. To answer this we let 
t -+ oo in Equation (3.58) and observe that x 100. 
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► Example 3.1 1 

A large tank initially contains 50 gal of brine in which there is dissolved 10 lb of salt. 
Brine containing 2 lb of dissolved salt per gallon flows into the tank at the rate of 
5 gal/min. The mixture is kept uniform by stirring, and the stirred mixture simulta- 
neously flows out at the slower rate of 3 gal/min. How much salt is in the tank at any 
time t > 0? 

Mathematical Formulation. 

use Equation (3.55): 

Proceeding as in Example 3.10, 

IN = (2 lb/gal)(5 gal/min) = 10 lb/min; 


Let x = the amount of salt at time t . Again we shall 


dx 

~dt 


= IN - OUT. 


also, once again 


OUT = (C lb/gal)(3 gal/min), 

where C lb/gal denotes the concentration. But here, since the rate of outflow is different 
from that of inflow, the concentration is not quite so simple. At time t = 0, the tank 
contains 50 gal of brine. Since brine flows in at the rate of 5 gal/min but flows out at the 
slower rate of 3 gal/min, there is a net gain of 5 - 3 = 2 gal/min of brine in the tank. 
Thus at the end of t minutes the amount of brine in the tank is 

50 + 2 1 gal. 

Hence the concentration at time t minutes is 


x 

50 -h 2r 


lb/gal, 


and so 


OUT = 


3x 

50 + 2 t 


lb/min. 


Thus the differential equation becomes 


dx 3x 

~dt = ” 50 + 2 1 


(3.59) 


Since there was initially 10 lb of salt in the tank, we have the initial condition 

x(0) = 10. (3.60) 


Solution. The differential equation (3.59) is not separable but it is linear. Putting it 
in standard form, 


dx 3 
It + 2t + 50 X 


10 , 



3.3 RATE PROBLEMS 97 


we find the integrating factor 


exp 


fs-M 


2t -f 50 

Multiplying through by this, we have 

dx 


dt ) = (2 r + 50) 3/2 . 


(2 1 + 50) 3/2 — + 3(2 1 + 50) 1/2 x = 10(2t + 50) 3/2 
dt 


or 


Thus 


or 


dt 


[(2 1 + 50) 3/2 x] = 10(2 1 + 50) 


3/2 


(It + 50) 3/2 x = 2(2 1 + 50) 5/2 + c 


x = 4 (r + 25) + • 


(2t -f 50) 3/2 ’ 

Applying condition (3.60), x = 10 at t = 0, we find 

c 


10 = 100 + 


(50) 


3/2 


or 


c = — (90)(50) 3/2 = -22,500^/2. 
Thus the amount of salt at any time t > 0 is given by 

22,500^/2 

I = 41+1 °°~ (2, + 5o7 ' 


Exercises 

1. Assume that the rate at which radioactive nuclei decay is proportional to the 
number of such nuclei that are present in a given sample. In a certain sample 10% 
of the original number of radioactive nuclei have undergone disintegration in a 
period of 100 years. 

(a) What percentage of the original radioactive nuclei will remain after 1000 
years? 

(b) In how many years will only one-fourth of the original number remain? 

2. A certain chemical is converted into another chemical by a chemical reaction. The 
rate at which the first chemical is converted is proportional to the amount of this 
chemical present at any instant. Ten percent of the original amount of the first 
chemical has been converted in 5 min. 

(a) What percent of the first chemical will have been converted in 20 min? 

(b) In how many minutes will 60% of the first chemical have been converted? 
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3. A chemical reaction converts a certain chemical into another chemical, and the 
rate at which the first chemical is converted is proportional to the amount of this 
chemical present at any time. At the end of one hour, 50 gm of the first chemical 
remain; while at the end of three hours, only 25 gm remain. 

(a) How many grams of the first chemical were present initially? 

(b) How many grams of the first chemical will remain at the end of five hours? 

(c) In how many hours will only 2 gm of the first chemical remain? 

4. A chemical reaction converts a certain chemical into another chemical, and the 
rate at which the first chemical is converted is proportional to the amount of this 
chemical present at any time. At the end of one hour, two-thirds kg of the first 
chemical remains, while at the end of four hours, only one-third kg remains. 

(a) What fraction of the first chemical remains at the end of seven hours? 

(b) When will only one-tenth of the first chemical remain? 

5. Assume that the population of a certain city increases at a rate proportional to the 
number of inhabitants at any time. If the population doubles in 40 years, in how 
many years will it triple? 

6. The population of the city of Bingville increases at a rate proportional to the 
number of its inhabitants present at any time t. If the population of Bingville was 
30,000 in 1970 and 35,000 in 1980, what will be the population of Bingville in 
1990? 

7. In a certain bacteria culture the rate of increase in the number of bacteria is 
proportional to the number present. 

(a) If the number triples in 5 hr, how many will be present in 10 hr? 

(b) When will the number present be 10 times the number initially present? 

8. Assume that the rate of change of the human population of the earth is pro- 
portional to the number of people on earth at any time, and suppose that this 
population is increasing at the rate of 2% per year. The 1979 World Almanac gives 
the 1978 world population estimate as 4,219 million; assume this figure is in fact 
correct. 

(a) Using this data, express the human population of the earth as a function of 
time. 

(b) According to the formula of part (a), what was the population of the earth in 
1950? The 1979 World Almanac gives the 1950 world population estimate as 
2,510 million. Assuming this estimate is very nearly correct, comment on the 
accuracy of the formula of part (a) in checking such past populations. 

(c) According to the formula of part (a), what will be the population of the earth 
in 2000? Does this seem reasonable? 

(d) According to the formula of part (a), what was the population of the earth in 
1900? The 1979 World Almanac gives the 1900 world population estimate as 
1,600 million. Assuming this estimate is very nearly correct, comment on the 
accuracy of the formula of part (a) in checking such past populations. 

(e) According to the formula of part (a), what will be the population of the earth 
in 2100? Does this seem reasonable? 
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9. The human population of a certain island satisfies the logistic law (3.50) with 
k = 0.03, X = 3(10) -8 , and time t measured in years. 

(a) If the population in 1980 is 200,000, find a formula for the population in 
future years. 

(b) According to the formula of part (a), what will be the population in the year 
2000 ? 

(c) What is the limiting value of the population as t -* oo? 

10. This is a general problem about the logistic law of growth. A population satisfies 
the logistic law (3.50) and has x 0 members at time t 0 . 

(a) Solve the differential equation (3.50) and thus express the population x as a 
function of t. 

(b) Show that as t -> oo, the population x approaches the limiting value k/X. 

(c) Show that dx/dt is increasing if x < k/ 2X and decreasing if x > k/2X. 

(d) Graph x as a function of t for t > t 0 . 

(e) Interpret the results of parts (b), (c), and (d). 

11. The human population of a certain small island would satisfy the logistic law 
(3.50), with k = 4 ^o, X = (10)~ 8 , and t measured in years, provided the annual 
emigration from the island is neglected. However, the fact is that every year 100 
people become disenchanted with island life and move from the island to the 
mainland. Modify the logistic differential equation (3.50) with the given k and X so 
as to include the stated annual emigration. Assuming that the population in 1980 
is 20,000, solve the resulting initial-value problem and thus find the pop- 
ulation of the island as a function of time. 

12. Under natural circumstances the population of mice on a certain island would 
increase at a rate proportional to the number of mice present at any time, 
provided the island had no cats. There were no cats on the island from the 
beginning of 1970 to the beginning of 1980, and during this time the mouse 
population doubled, reaching an all-time high of 100,000 at the beginning of 
1980. At this time the people of the island, alarmed by the increasing number of 
mice, imported a number of cats to kill the mice. If the indicated natural rate of 
increase of mice was thereafter offset by the work of the cats, who killed 1000 mice 
a month, how many mice remained at the beginning of 1981? 

13. An amount of invested money is said to draw interest compounded continuously if 
the amount of money increases at a rate proportional to the amount present. 
Suppose $1000 is invested and draws interest compounded continuously, where 
the annual interest rate is 6%. 

(a) How much money will be present 10 years after the original amount was 
invested? 

(b) How long will it take the original amount of money to double? 

14. Suppose a certain amount of money is invested and draws interest compounded 
continuously. 

(a) If the original amount doubles in two years, then what is the annual interest 
rate? 
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(b) If the original amount increases 50% in six months, then how long will it take 
the original amount to double? 

15. A tank initially contains 100 gal of brine in which there is dissolved 20 lb of salt. 
Starting at time t = 0, brine containing 3 lb of dissolved salt per gallon flows into 
the tank at the rate of 4 gal/min. The mixture is kept uniform by stirring and the 
well-stirred mixture simultaneously flows out of the tank at the same rate. 

(a) How much salt is in the tank at the end of 10 min? 

(b) When is there 160 lb of salt in the tank? 

16. A large tank initially contains 100 gal of brine in which 10 lb of salt is dissolved. 
Starting at t = 0, pure water flows into the tank at the rate of 5 gal/min. The 
mixture is kept uniform by stirring and the well-stirred mixture simultaneously 
flows out at the slower rate of 2 gal/min. 

(a) How much salt is in the tank at the end of 15 min and what is the 
concentration at that time? 

(b) If the capacity of the tank is 250 gal, what is the concentration at the instant 
the tank overflows? 

17. A tank initially contains 100 gal of pure water. Starting at t = 0, a brine 
containing 4 lb of salt per gallon flows into the tank at the rate of 5 gal/min. The 
mixture is kept uniform by stirring and the well-stirred mixture flows out at the 
slower rate of 3 gal/min. 

(a) How much salt is in the tank at the end of 20 min? 

(b) When is there 50 lb of salt in the tank? 

18. A large tank initially contains 200 gal of brine in which 15 lb of salt is dissolved. 
Starting at t = 0, brine containing 4 lb of salt per gallon flows into the tank at the 
rate of 3.5 gal/min. The mixture is kept uniform by stirring and the well-stirred 
mixture leaves the tank at the rate of 4 gal/min. 

(a) How much salt is in the tank at the end of one hour? 

(b) How much salt is in the tank when the tank contains only 50 gal of brine? 

19. A 500 liter tank initially contains 300 liters of fluid in which there is dissolved 
50 gm of a certain chemical. Fluid containing 30 gm per liter of the dissolved 
chemical flows into the tank at the rate of 4 liters/min. The mixture is kept 
uniform by stirring, and the stirred mixture simultaneously flows out at the rate of 
2.5 liters/min. How much of the chemical is in the tank at the instant it overflows? 

20. A 200 liter tank is initially full of fluid in which there is dissolved 40 gm of a 
certain chemical. Fluid containing 50 gm per liter of this chemical flows into the 
tank at the rate of 5 liters/min. The mixture is kept uniform by stirring, and the 
stirred mixture simultaneously flows out at the rate of 7 liters/min. How much of 
the chemical is in the tank when it is only half full? 

21. The air in a room whose volume is 10,000 cu ft tests 0.15% carbon dioxide. 
Starting at t = 0, outside air testing 0.05% carbon dioxide is admitted at the rate 
of 5000 cu ft/min. 

(a) What is the percentage of carbon dioxide in the air in the room after 3 min? 

(b) When does the air in the room test 0.1% carbon dioxide? 
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22. The air in a room 50 ft by 20 ft by 8 ft tests 0.2% carbon dioxide. Starting at t = 0, 
outside air testing 0.05% carbon dioxide is admitted to the room. How many 
cubic feet of this outside air must be admitted per minute in order that the air in 
the room test 0.1% at the end of 30 min? 

23. Newton’s law of cooling states that the rate at which a body cools is proportional 
to the difference between the temperature of the body and that of the medium in 
which it is situated. A body of temperature 80 °F is placed at time t = 0 in a 
medium the temperature of which is maintained at 50 °F. At the end of 5 min, the 
body has cooled to a temperature of 70 °F. 

(a) What is the temperature of the body at the end of 10 min? 

(b) When will the temperature of the body be 60 °F? 

24. A body cools from 60 °C to 50 °C in 15 min in air which is maintained to 30 °C. 
How long will it take this body to cool from 100 °C to 80 °C in air that is main- 
tained at 50 °C? Assume Newton’s law of cooling (Exercise 23). 

25. The rate at which a certain substance dissolves in water is proportional at the 
product of the amount undissolved and the difference c x — c 2 , where c x is the 
concentration in the saturated solution and c 2 is the concentration in the actual 
solution. If saturated, 50 gm of water would dissolve 20 gm of the substance. If 
10 gm of the substance is placed in 50 gm of water and half of the substance is 
then dissolved in 90 min, how much will be dissolved in 3 hr? 



CHAPTER FOUR 

Explicit Methods of Solving Higher-Order 
Linear Differential Equations 


The subject of ordinary linear differential equations is one of great theoretical and 
practical importance. Theoretically, the subject is one of simplicity and elegance. 
Practically, linear differential equations originate in a variety of applications to science 
and engineering. Fortunately many of the linear differential equations that thus occur 
are of a special type, linear with constant coefficients, for which explicit methods of 
solution are available. The main purpose of this chapter is to study certain of these 
methods. First, however, we need to consider certain basic theorems that will be used 
throughout the chapter. These theorems are stated and illustrated in Section 4.1, but 
proofs are omitted in this introductory section. By far the most important case is that of 
the second- order linear differential equation, and we shall explicitly consider and 
illustrate this case for each important concept and result presented. In the final section 
of the chapter we return to this fundamental theory and present theorems and proofs in 
this important special case. Proofs in the general case are given in Chapter 11. 


4.1 BASIC THEORY OF LINEAR DIFFERENTIAL EQUATIONS 
A. Definition and Basic Existence Theorem 
DEFINITION 


A linear ordinary differential equation of order n in the dependent variable y and the 
independent variable x is an equation that is in, or can be expressed in, the form 


d n y 




+ •" + + a„(x)y = F(x), (4.1) 
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where a 0 is not identically zero . We shall assume that a 0 ,a 1 ,...,a„ and F are continuous 
real functions on a real interval a <x< b and that a 0 (x) # 0 for any x on a < x < b. The 
right-hand member F(x) is called the nonhomogeneous term. If F is identically zero , 
Equation (4.1) reduces to 

d n y d nl y dy 

a ° (*) ^ + • • • + a n- iM ^ + a„(x)y = 0 (4.2) 

and is then called homogeneous. 

For n = 2, Equation (4.1) reduces to the second- order nonhomogeneous linear 
differential equation 

a ° ^ Jx 1 + a '' ^Jx + ar ^ y = < 43 ) 

and (4.2) reduces to the corresponding second-order homogeneous equation 

«o(*) ^ + a 2 (x)y = 0. (4.4) 

Here we assume that a 0 ,a l ,a 2 , and F are continuous real functions on a real interval 
a < x < b and that a 0 (x) # 0 for any x on a < x < b. 


► Example 4.1 


The equation 


dx : 


+ 3x 


dy 

dx 


+ x 3 y = e x 


is a linear ordinary differential equation of the second order. 


d 3 y , „d 2 y , , , 


► Example 4.2 

The equation 

A2 

dx 3 ' v dx 2 ' dx 
is a linear ordinary differential equation of the third order. 

We now state the basic existence theorem for initial-value problems associated with 
and nth-order linear ordinary differential equation: 


3 + X T32 + 3x TT-^ = sinx 


THEOREM 4.1 
Hypothesis 

1. Consider the nth-order linear differential equation 
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where a 0 , a x , . . . , a n and F are continuous real functions on a real interval a < x <b and 
a 0 (x) # 0 for any x on a < x < b. 

2. Let x 0 be any point of the interval a < x <b, and let c 0 ,c 1 ,. . . , c„ _ x be n arbitrary 
real constants. 

Conclusion. There exists a unique solution f of (4.1) such that 
f(X 0 ) = C 0 ,f'(x 0 ) = C 1 ,...,/ ( " _1) (x 0 ) = c„_ 1; 
and this solution is defined over the entire interval a < x < b. 

Suppose that we are considering an nth-order linear differential equation (4.1), the 
coefficients and nonhomogeneous term of which all possess the continuity require- 
ments set forth in Hypothesis 1 of Theorem 4. 1 on a certain interval of the x axis. Then, 
given any point x 0 of this interval and any n real numbers c 0 ,c l5 . . . ,c„_ 1? the theorem 
assures us that there is precisely one solution of the differential equation that assumes 
the value c 0 at x = x 0 and whose fcth derivative assumes the value c k for each k = 
l,2,...,n — 1 at x = x 0 . Further, the theorem asserts that this unique solution is 
defined for all x in the above-mentioned interval. 

For the second- order linear differential equation, 

«o (x) ~jf + a ^ x )~^ + a 2( x )y = F (x), (4-3) 

the requirements of Hypothesis 1 of Theorem 4.1 are that a 0 ,a l ,a 2 , and F be 
continuous on a real interval a < x <b and that a 0 (x) # 0 for any x on this interval. 
Then, if x 0 is any point of the interval a < x <b and c 0 and c x are any two real numbers, 
the theorem assures us that there is precisely one solution / of the second-order 
differential equation (4.3) which assumes the value c 0 at x = x 0 and whose first 
derivative assumes the value c x at x = x 0 : 

f(x 0 ) = c 0 , f(x 0 ) = c 1 . (4.5) 

Moreover, the theorem asserts that this unique solution / of Equation (4.3) which 
satisfies conditions (4.5) is defined for all x on the interval a < x < b. 


► Example 4.3 


Consider the initial-value problem 


d 2 y dy 3 

+ 3x-f + x 3 y = e x 

ax ax 


y( i) = 2, 

/(1)= -5. 

The coefficients 1, 3x, and x 3 , as well as the nonhomogeneous term e x , in this second- 
order differential equation are all continuous for all values of x, — oo < x < oo. The 
point x 0 here is the point 1, which certainly belongs to this interval; and the real 
numbers c 0 and c x are 2 and -5, respectively. Thus Theorem 4.1 assures us that a 
solution of the given problem exists, is unique, and is defined for all x, — oo < x < oo. 
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► Example 4.4 


Consider the initial-value problem 


A 3 ) 


d 2 


y , ,„ 2 d y 


2 ^ + X "j ^ +3x “ - - 5? = sin «, 


dx 


y( 4) = 3, 

/(4) = 5, 

/'(4) = -f 

Here we have a third-order problem. The coefficients 2,x, 3x 2 , and —5, as well as the 
nonhomogeneous term sin x, are all continuous for all x, — oo < x < oo. The point 
x 0 = 4 certainly belongs to this interval; the real numbers c 0 ,c l5 and c 2 in this prob- 
lem are 3, 5, and — respectively. Theorem 4.1 assures us that this problem also has 
a unique solution which is defined for all x, — oo < x < oo. 

A useful corollary to Theorem 4.1 is the following: 


COROLLARY 

Hypothesis. Let f be a solution of the nth-order homogeneous linear differential 
equation 

a 0 {x) + a^x) + ••• + «„- i(x) ^ + a n (x)y = 0 (4.2) 

such that 

f(x 0 ) = 0,/'(x o ) = 0,...,/ < " _1, (x o ) = 0, 

where x 0 is a point of the interval a < x < b in which the coefficients a 0 ,a l9 ...,a„ are 
all continuous and a 0 (x) / 0. 

Conclusion. Then /(x) = 0 for all x on a < x < b. 

Let us suppose that we are considering a homogeneous equation of the form (4.2), all 
the coefficients of which are continuous on a certain interval of the x axis. Suppose 
further that we have a solution / of this equation which is such that / and its first n — 1 
derivatives all equal zero at a point x 0 of this interval. Then this corollary states that 
this solution is the “trivial” solution / such that /(x) = 0 for all x on the above- 
mentioned interval. 


► Example 4.5 

The unique solution / of the third-order homogeneous equation 
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which is such that 

/(2) = /'(2) = f" 2) = 0, 

is the trivial solution / such that f(x) = 0 for all x. 

B. The Homogeneous Equation 

We now consider the fundamental results concerning the homogeneous equation 

d n v d nl v dv 

a o(*) dx" + a ‘ (x ^ dx + " ■ + ^ + a n( x )y = 0. (4.2) 

We first state the following basic theorem: 


THEOREM 4.2 BASIC THEOREM ON LINEAR HOMOGENEOUS 
DIFFERENTIAL EQUATIONS 


Hypothesis. Let /i,/ 2 ,...,/ m be any m solutions of the homogeneous linear differ- 
ential equation (4.2). 

Conclusion. Then c l f 1 -f c 2 f 2 + ' * * + c m f m is also a solution of (4.2), where 
c l ,c 2 ,...,c m are m arbitrary constants. 

Theorem 4.2 states that if m known solutions of (4.2) are each multiplied by an 
arbitrary constant and the resulting products are then added together, the resulting 
sum is also a solution of (4.2). We may put this theorem in a very simple form by means 
of the concept of linear combination, which we now introduce. 


DEFINITION 

If fi , f 2 , . . , f m ar e m given functions, and c 1 ,c 2 ,...,c m are m constants, then the ex- 
pression 


C \f\ + C lfl + " ’ + C mfm 
is called a linear combination of f x , f 2 , . . . , f m • 

In terms of this concept, Theorem 4.2 may be stated as follows: 


THEOREM 4.2 (RESTATED) 

Any linear combination of solutions of the homogeneous linear differential equation (4.2) 
is also a solution of (4.2). 

In particular, any linear combination 


C lfl + C 2fl + *” + C mfm 
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of m solutions /i,/ 2 > •••,/», °f the second - order homogeneous linear differential 
equation 


a 0 (x) 


V dv 

~2 + + a 2( x )y = 0 


(4.4) 


is also a solution of (4.4). 

► Example 4.6 

The student will readily verify that sin x and cos x are solutions of 

d 2 y 

d^ + y = 0 ' 

Theorem 4.2 states that the linear combination c x sin x -f c 2 cos x is also a solution for 
any constants c x and c 2 . For example, the particular linear combination 

5 sin x + 6 cos x 


is a solution. 


► Example 4.7 


The student may verify that e x , e x , and e 2x are solutions of 


d 3 y ~ d 2 y 


~t^ = O' 


Theorem 4.2 states that the linear combination c x e x -f c 2 e x + c 3 e lx is also a solution 
for any constants c l9 c 2 , and c 3 . For example, the particular linear combination 


2e x — 3e x + \e 2x 


is a solution. 


We now consider what constitutes the so-called general solution of (4.2). To 
understand this we first introduce the concepts of linear dependence and linear 
independence. 


DEFINITION 

The n functions fi,f 2 ,--">f n are called linearly dependent on a < x < b if there exist 
constants c 1 ,c 2 ,...,c„, not all zero, such that 

Cl fi (x) + C 2 f 2 (x) + ■■■ + c„/„(x) = 0 

for all x such that a < x < b. 

In particular, two functions f { andf 2 are linearly dependent on a < x < bif there exist 
constants c 1? c 2 , not both zero, such that 

cji(x) + c 2 f 2 (x) = 0 


for all x such that a < x < b. 
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► Example 4.8 

We observe that x and 2x are linearly dependent on the interval 0 < x < 1. For there 
exist constants and c 2 , not both zero , such that 

Cix + c 2 (2x) = 0 

for all x on the interval 0 < x < 1. For example, let c x = 2, c 2 = — 1. 


► Example 4.9 

We observe that sin x, 3 sin x, and —sin x are linearly dependent on the interval 
— 1 < x < 2. For there exist constants c t ,c 2 ,c 3 , not all zero , such that 

c x sin x + c 2 (3 sin x) + c 3 ( — sin x) = 0 

for all x on the interval — 1 < x < 2. For example, let c x = l,c 2 = l,c 3 = 4. 


DEFINITION 


The n functions f x , f 2 ,...,/„ are called linearly independent on the interval a < x <bif 
they are not linearly dependent there. That is, the functions /i , / 2 , • . . , are linearly 
independent on a < x < b if the relation 

cJi(x) + C 2 f 2 {x) + ••• + cj„(x) = 0 

for all x such that a < x < b implies that 

c 1= c 2 ="=c„ = 0. 

In other words, the only linear combination of f,f 2 , ■■■,/„ that is identically zero on 
a < x < b is the trivial linear combination 

0 • /i + 0 * f 2 + ••• + 0 • /„. 

In particular, two functions f x and f 2 are linearly independent on a < x < b if the 
relation 


C i/iW + c 2 / 2 (x) = 0 
for all x on a < x < b implies that 

c i = c 2 = 0 . 


► Example 4.10 

We assert that x and x 2 are linearly independent on 0 < x < 1, since c!x + c 2 x 2 = 0 
for all x on 0 < x < 1 implies that both c x = 0 and c 2 = 0. We may verify this in 
the following way. We differentiate both sides of c x x + c 2 x 2 = 0 to obtain c x + 
2c 2 x = 0, which must also hold for all x on 0 < x < 1. Then from this we also have 
c x x + 2c 2 x 2 = 0 for all such x. Thus we have both 

C!X + c 2 x 2 = 0 and c x x + 2c 2 x 2 = 0 


(4.6) 
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for all x on 0 < x < 1. Subtracting the first from the second gives c 2 x 2 = 0 for all x on 
0 < x < 1, which at once implies c 2 = 0. Then either of (4.6) show similarly that c x = 0. 

The next theorem is concerned with the existence of sets of linearly independent 
solutions of an nth-order homogeneous linear differential equation and with the 
significance of such linearly independent sets. 


THEOREM 4.3 


The nth-order homogeneous linear differential equation 

d n v d nl v dv 

+ g i(*) <fcc „-i + + a «-i(x)j^ + a„(x)y = 0 (4.2) 

always possesses n solutions that are linearly independent. Further , if /i,/ 2 , . . . ,/„ are n 
linearly independent solutions of (4.2), then every solution f of (4.2) can be expressed as a 
linear combination 


C l/l + C lfl + ••• + C nfn 

of these n linearly independent solutions by proper choice of the constants c x ,c 2 ,. . . ,c n . 

Given an nth-order homogeneous linear differential equation, this theorem assures 
us first that a set of n linearly independent solutions actually exists. The existence of 
such a linearly independent set assured, the theorem goes on to tell us that any solution 
whatsoever of (4.2) can be written as a linear combination of such a linearly inde- 
pendent set of n solutions by suitable choice of the constants c l5 c 2 , . . . , c n . 

For the second- order homogeneous linear differential equation 

a 0 (x) + a i M + a 2 (x)y = 0, (4.4) 

Theorem 4.3 first assures us that a set of two linearly independent solutions exists. The 
existence of such a linearly independent set assured, let f t and f 2 be a set of two linearly 
independent solutions. Then if / is any solution of (4.4), the theorem also assures us 
that / can be expressed as a linear combination c l f l + c 2 f 2 of the two linearly 
independent solutions f x and f 2 by proper choice of the constants c 1 and c 2 . 


► Example 4.1 1 

We have observed that sin x and cos x are solutions of 

d 2 v 

+ < 47 > 

for all x, — oo < x < oo. Further, one can show that these two solutions are linearly 
independent. Now suppose / is any solution of (4.7). Then by Theorem 4.3 / can be 
expressed as a certain linear combination c x sin x + c 2 cos x of the two linearly 
independent solutions sin x and cos x by proper choice of c x and c 2 . That is, there exist 
two particular constants c x and c 2 such that 

f(x) = c x sin x + c 2 cos x 


(4.8) 
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for all x, — oo < x < oo. For example, one can easily verify that/(x) = sin(x + 7r/6) is a 
solution of Equation (4.7). Since 

. / 71 \ 71 .71 x /3 . 1 

sin x + — = sin x cos — + cos x sin — = sin x + - cos x, 

V 6 / 6 6 2 2 

we see that the solution sin(x + n/6) can be expressed as the linear combination 

73 . ! 

sin x 4- icos x 

of the two linearly independent solutions sin x and cos x. Note that this is of the form in 
the right member of (4.8) with c x = y/i/2 and c 2 = \. 

Now let /i,/ 2 ,...,/„ be a set of n linearly independent solutions of (4.2). Then by 
Theorem 4.2 we know that the linear combination 


c i/i + c ifi + ••* + c n f n , (4.9) 

where c 1 ,c 2 ,...,c #J are n arbitrary constants, is also a solution of (4.2). On the other 
hand, by Theorem 4.3 we know that if / is any solution of (4.2), then it can be expressed 
as a linear combination (4.9) of the n linearly independent solutions /i , / 2 by a 

suitable choice of the constants c l9 c 2 ,...,c„. Thus a linear combination (4.9) of the 
n linearly independent solutions /i,/ 2 ,...,/„ in which c 1 ,c 2 ,...,c„ are arbitrary 
constants must include all solutions of (4.2). For this reason, we refer to a set of n 
linearly independent solutions of (4.2) as a “fundamental set” of (4.2) and call a 
“general” linear combination of n linearly independent solutions a “general solution” 
of (4.2), in accordance with the following definition: 


DEFINITION 


If /i , f 2 , • . . , f n are n linearly independent solutions of the nth-order homogeneous linear 
differential equation 

d n v d nl v dv 

a °^ dx " + ai ^ dx + ' ' ■ + a »- iW fa + a n( x )y = 0 ( 4 - 2 ) 

on a < x < b, then the set f x , f 2 , . . . , /„ is called a fundamental set of solutions of (4.2) 
and the function f defined by 

fix) = Ci f i (x) + c 2 / 2 (x) + •*• + c„f n (x), a < x < b, 

where c 1 ,c 2 ,...,c„ are arbitrary constants , is called a general solution of (4.2) on 
a < x <b. 

Therefore, if we can find n linearly independent solutions of (4.2), we can at once 
write the general solution of (4.2) as a general linear combination of these n solutions. 
For the second- order homogeneous linear differential equation 

a 0 (x) ^T + a iW^ + a 2 (x)y = 0, (4.4) 

a fundamental set consists of two linearly independent solutions. If f { and f 2 are a 
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fundamental set of (4.4) on a < x < b, then a general solution of (4.4) on a < x < b is 
defined by 

Cifi(x) + c 2 / 2 (x), a<x<b , 

where c x and c 2 are arbitrary constants. 


► Example 4.12 


We have observed that sin x and cos x are solutions of 


d 2 y 

ds +y =° 


for all x, — oo < x < oo. Further, one can show that these two solutions are linearly 
independent. Thus, they constitute a fundamental set of solutions of the given 
differential equation, and its general solution may be expressed as the linear 
combination 


Cj sin x -f c 2 cos x, 

where c { and c 2 are arbitrary constants. We write this as y = c x sin x + c 2 cos x. 


► Example 4.1 3 


The solutions e x , e x , and e 2x of 


<C_y 2 <P_y dy 
dx 3 dx 2 dx 


+ 2y = 0 


may be shown to be linearly independent for all x, — oo < x < oo.Thus,e x , ^“ x ,and^ 2x 
constitute a fundamental set of the given differential equation, and its general solution 
may be expressed as the linear combination 

c x e x + c 2 e~ x + c 3 e 2x , 

where c 1? c 2 , and c 3 are arbitrary constants. We write this as 

y = c x e x + c 2 e~ x + c 3 e lx . 

The next theorem gives a simple criterion for determining whether or not n solutions 
of (4.2) are linearly independent. We first introduce another concept. 


DEFINITION 


Let fuf 2 >---,f n b e n real functions each of which has an (n — 1 )st derivative on a real 
interval a < x < b. The determinant 


W(fuf 2 ,-- Jn) = 


fl 
r i 


fi 

fi 


f (n-V f { ”~ l) 


fn 

f'n 


fin- 1 ) 
J n 
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in which primes denote derivatives , is called the Wronskian of these n functions. We 
observe that W(f i,/ 2 , is itself a real function defined on a < x < b. Its value at x 

is denoted by W(f u f 2 ,...J n )(x) or by W[/i(x),/ 2 (x),...,/„(x)]. 


THEOREM 4.4 

The n solutions fi,f 2 ,..-,f n of the nth-order homogeneous linear differential equation 
(4.2) are linearly independent on a < x < b if and only if the Wronskian of / 1? / 2 , • • • ,/„ 
is different from zero for some x on the interval a < x < b. 

We have further: 


THEOREM 4.5 


The Wronskian of n solutions fuf 2 ,---,f n of (4.2) is either identically zero on a < x < b 
or else is never zero on a < x < b. 


Thus if we can find n solutions of (4.2), we can apply the Theorems 4.4. and 4.5 to 
determine whether or not they are linearly independent. If they are linearly inde- 
pendent, then we can form the general solution as a linear combination of these n 
linearly independent solutions. 

In the case of the general second-order homogeneous linear differential equation 


d 2 y 

+ Mr + a i(x)y = o, 




(4.4) 


the Wronskian of two solutions f x and f 2 is the second-order determinant 


W(f u f 2 ) 


fx 

fx 


f 2 

fi 


— f\fl ~ f\fi • 


By Theorem 4.4, two solutions f x and f 2 of (4.4) are linearly independent on a < x < b 
if and only if their Wronskian is different from zero for some x on a < x < b; and by 
Theorem 4.5, this Wronskian is either always zero or never zero on a < x < b. Thus if 
on a<x<b, solutions f x and f 2 of (4.4) are linearly independent 
on a < x < b and the general solution of (4.4) can be written as the linear combination 


+ C 2 f 2 (x), 


where c x and c 2 are arbitrary constants. 


► Example 4.14 


We apply Theorem 4.4 to show that the solutions sin x and cos x of 


d 2 y 

dx 2 


+ y = 0 
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are linearly independent. We find that 


W(sin x, cos x) 


sin x cos x 
cos x — sin x 


— sin 2 x — cos 2 x = — 1 # 0 


for all real x. Thus, since JT(sin x, cos x) # 0 for all real x, we conclude that sin x and 
cos x are indeed linearly independent solutions of the given differential equation on 
every real interval. 


► Example 4.1 5 

The solutions e x , e ~ x , and e 2x of 

d 3 y d 2 y dy 

are linearly independent on every real interval, for 


W(e x ,e' -,e 
for all real x. 


2x 


) = 


e x 

e~ x 

e 2x 


i 

1 

i 

e x 

— e~ x 

2e lx 

= e 2x 

i 

-1 

2 

e x 

e~ x 

4e 2x 


i 

1 

4 


= —6e 2x ?= 0 


Exercises 

1. Theorem 4.1 applies to one of the following problems but not to the other. 
Determine to which of the problems the theorem applies and state precisely the 
conclusion which can be drawn in this case. Explain why the theorem does not 
apply to the remaining problem. 

(a) lx 1 + 5 % + 6y = = 5 ’ y ^ = 7 ' 

(b) u + 5 l +6y - e *- m = 5 - m=1 

2. Answer orally: What is the solution of the following initial-value problem? Why? 

g + ^ + x.,, 0 , XD-o, /(i) = o. 

3. Prove Theorem 4.2 for the case m = n = 2. That is, prove that if f { (x) and / 2 (x) 
are two solutions of 


d 2 y dy 

a 0 (x) ^2 + a l W fa + a 2 Wy = 

then c l f l (x) + c 2 / 2 (x) is also a solution of this equation, where c x and c 2 are 
arbitrary constants. 
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4. Consider the differential equation 


d 2 y dy 
d^~ 4 dx + y 


0 . 


(A) 


(a) Show that each of the functions e x and e 3x is a solution of differential 
equation (A) on the interval a < x < b, where a and b are arbitrary real 
numbers such that a < b. 

(b) What theorem enables us to conclude at once that each of the functions 

5e x -f 2e 3x , 6e x - 4e 3x , and -le x + 5e 3x 

is also a solution of differential equation (A) on a < x < bl 

(c) Each of the functions 

3e x , —4e x , 5e x , and 6e x 
is also a solution of differential equation (A) on a < x < b. Why? 

5. Again consider the differential equation (A) of Exercise 4. 

(a) Use the definition of linear dependence to show that the four functions of 
part (c) of Exercise 4 are linearly dependent on a < x < b. 

(b) Use Theorem 4.4 to show that each pair of the four solutions of differential 
equation (A) listed in part (c) of Exercise 4 are linearly dependent on 
a < x < b. 


6. Again consider the differential equation (A) of Exercise 4. 

(a) Use the definition of linear independence to show that the two functions e x 
and e 3x are linearly independent on a < x < b. 

(b) Use Theorem 4.4 to show that the two solutions e x and e 3x of differential 
equation (A) are linearly independent on a < x < b. 


7. Consider the differential equation 


d 2 y 

dx 2 


5 ^ + 6y = 0. 
dx 


(a) Show that e 2x and e 3x are linearly independent solutions of this equation on 
the interval — oo < x < oo. 

(b) Write the general solution of the given equation. 

(c) Find the solution that satisfies the conditions y(0) = 2, /(0) = 3. Explain 
why this solution is unique. Over what interval is it defined? 

8. Consider the differential equation 


dx 2 dx 


+ y = 0. 


(a) Show that e x and xe x are linearly independent solutions of this equation on 
the interval — oo < x < oo. 

(b) Write the general solution of the given equation. 

(c) Find the solution that satisfies the condition y(0) = 1, y'(0) = 4. Explain why 
this solution is unique. Over what interval is it defined? 
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9. Consider the differential equation 


d^y 

dx 2 


2x^ + 2, 

dx 


0 . 


(a) Show that x and x 2 are linearly independent solutions of this equation on 
the interval 0 < x < oo. 

(b) Write the general solution of the given equation. 

(c) Find the solution that satisfies the conditions y(l) = 3, y'(l) = 2. Explain 
why this solution is unique. Over what interval is this solution defined? 


10. Consider the differential equation 


* 2 ^ + * $ - ^ 


dx 


dx 


(a) Show that x 2 and 1 /x 2 are linearly independent solutions of this equation on 
the interval 0 < x < oo. 

(b) Write the general solution of the given equation. 

(c) Find the solution that satisfies the conditions y(2) = 3, y'(2) = — 1. Explain 
why this solution is unique. Over what interval is this solution defined? 


11. Consider the differential equation 


d^y , dy 
dx 2 dx 


+ 4y = 0. 


(a) Show that each of the functions e x , e 4x , and 2e x — 3e 4x is a solution of this 
equation on the interval — oo < x < oo. 

(b) Show that the solutions e x and e 4x are linearly independent on - oo < 

X < oo. 

(c) Show that the solutions e x and 2e x — 3e 4x are also linearly independent on 

— OO < X < oo. 

(d) Are the solutions e 4x and 2e x — 3e 4x still another pair of linearly indepen- 
dent solutions on — oo < x < oo? Justify your answer. 

12. Given that e~ x , e 3x , and e 4x are all solutions of 


d^y d*y dy 

dx 3 dx 2 dx 


+ 12 y = 0, 


show that they are linearly independent on the interval — oo < x < oo and write 
the general solution. 


13. Given that x, x 2 , and x 4 are all solutions of 


dx dx dx 


show that they are linearly independent on the interval 0 < x < oo and write the 
general solution. 
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C. Reduction of Order 

In Section 4.2 we shall begin to study methods for obtaining explicit solutions of 
higher-order linear differential equations. There and in later sections we shall find that 
the following theorem on reduction of order is often quite useful. 


THEOREM 4.6 


Hypothesis. Let f be a nontrivial solution of the nth-order homogeneous linear dif- 
ferential equation 


d n y d n 1 


dy 

+ ••• + a„_,(x)— + a„(x)y = 0. 


(4.2) 


Conclusion. The transformation y = f(x)v reduces Equation (4.2) to an (n — 1 )st- 
order homogeneous linear differential equation in the dependent variable w = dv/dx. 


This theorem states that if one nonzero solution of the nth-order homogeneous lin- 
ear differential equation (4.2) is known, then by making the appropriate transforma- 
tion we may reduce the given equation to another homogeneous linear equation that is 
one order lower than the original. Since this theorem will be most useful for us in 
connection with second-order homogeneous linear equations (the case where n = 2), 
we shall now investigate the second-order case in detail. Suppose / is a known 
nontrivial solution of the second-order homogeneous linear equation 


d 2 y dy 

a o (*) ^2 + a i M fa + a 2 (*)y = °- 


(4.10) 


Let us make the transformation 


y = f(x)v. 


(4.11) 


where / is the known solution of (4.10) and v is a function of x that will be determined. 
Then, differentiating, we obtain 


*- /w s +/,(xK (4 - i2) 

d 2 v d 2 v dv 

_^ = / M _ + 2/W _ + rWR (4 , 3) 


Substituting (4.11), (4.12), and (4.13) into (4.10), we obtain 



d 2 v dv 

^ r dv „ , x 

a 0 (x) 

[/(x)^ + 2/'(x)- + /"(x),J 

+ «l(x) f(x)— + f(x)v 


+ a 2 (x)f(x)v = 0 


or 


+ La 0 (x)f"(x) + a l (x)f'(x) + a 2 (x)/(x)]u = 0. 
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Since / is a solution of (4. 10), the coefficient of v is zero, and so the last equation reduces 
to 


Letting w = dv/dx , this becomes 

dw 


a 0 (x)f(x)~ + [2 a 0 (x)f'(x) + afix)f(x)]w = 0. 


(4.14) 


This is a first-order homogeneous linear differential equation in the dependent variable 
w. The equation is separable; thus assuming f(x) ^ 0 and a 0 (x) ^ 0, we may write 


dw 

w 


L fix) «oWJ 


Thus integrating, we obtain 

In | w| = — ln[/(x)] 2 - 


Qi(x) 

a 0 (x) 


dx + In I c I 


or 


r r«i(x) 

CCXP [ Jflo(x) 

W = _ . — 


dx 


[/(x)] 2 

This is the general solution of Equation (4.14); choosing the particular solution for 
which c = 1, recalling that dv/dx = w, and integrating again, we now obtain 

i(x) 


v = 


fa i(x) 
ex P _ — r - ; 
J a 0 (x) 


dx 


imv 


dx. 


Finally, from (4.1 1), we obtain 


y = fix) 




r f fli(x) 

L J «o(x) 


dx 


dx. 


(4.15) 


[/(x)] 2 

The function defined in the right member of (4. 1 5), which we shall henceforth denote by 
g, is actually a solution of the original second-order equation (4.10). Furthermore, this 
new solution g and the original known solution / are linearly independent, since 

I fix) g(x) 


= [/(x)] V = exp 
Thus the linear combination 


fix) f(x)v 

fix) f(x)v' + f'(x)v 

a fix) 


a 0 (x) 


dx 


# 0 . 


cj+ c 2 g 

is the general solution of Equation (4.10). We now summarize this discussion in the 
following theorem. 
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THEOREM 4.7 


Hypothesis. Let f be a nontrivial solution of the second-order homogeneous 
linear differential equation 


d 2 


dy 


a oM Jr + M tt + a 2 toy = °- 


dx 


dx 


(4.10) 


Conclusion 1. The transformation y = f(x)v reduces Equation (4.10) to the first- 
order homogeneous linear differential equation 

dw 

a 0 (x)f(x) — + [2 a 0 (x)f'{x) + a, (x)/(x)]w = 0 (4.14) 

in the dependent variable w, where w = dv/dx. 


Conclusion 2. The particular solution 



of Equation (4.14) gives rise to the function v, where 



The function g defined by g(x) = f(x)v(x) is then a solution of the second-order equation 
(4.10). 


Conclusion 3. The original known solution f and the “new” solution g are linearly 
independent solutions of (4.10), and hence the general solution of (4.10) may be expressed 
as the linear combination 


cj+c 2 g. 


Let us emphasize the utility of this theorem and at the same time clearly recognize its 
limitations. Certainly its utility is by now obvious. It tells us that if one solution of the 
second-order equation (4. 10) is known, then we can reduce the order to obtain a linearly 
independent solution and thereby obtain the general solution of (4.10). But the 
limitations of the theorem are equally obvious. One solution of Equation (4.10) must 
already be known to us in order to apply the theorem. How does one “already know” a 
solution? In general one does not. In some cases the form of the equation itself or 
related physical considerations suggest that there may be a solution of a certain special 
form: for example, an exponential solution or a linear solution. However, such cases are 
not too common and if no solution at all can be so ascertained, then the theorem will 
not aid us. 

We now illustrate the method of reduction of order by means of the following 
example. 
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► Example 4.16 

Given that y = x is a solution of 


( x ’ + i ) 0 - 2x £ +2>,=o - ,4i6 » 

find a linearly independent solution by reducing the order. 

Solution. First observe that y = x does satisfy Equation (4.16). Then let 

y = xv. 

Then 


dy dv 

dx dx 


+ v 



X 


d 2 v dv 
dx 2 + dx 


Substituting the expressions for y, dy/dx , and d 2 y/dx 2 into Equation (4.16), we obtain 


d 2 v 


dv 


(* + *) + 2 -r: 


dx 


dx 


dv 

x — + v ) + 2xv = 0 
dx 


or 


d 2 v 


dv 


x(x2 + 1 W + 2 s= a 


Letting w = dv/dx we obtain the first- order homogeneous linear equation 

dw 

x(x 2 + 1)— + 2w = 0. 
dx 


Treating this as a separable equation, we obtain 

dw 2 dx 

w x(x 2 + 1) 


or 


dw 

w 




dx. 


Integrating, we obtain the general solution 

c(x 2 + 1) 
w = = 


Choosing c = 1, we recall that dv/dx = w and integrate to obtain the function v given 
by 

v(x) = x — — . 

X 


Now forming g = fv, where f(x) denotes the known solution x, we obtain the function g 
defined by 


g{x) = x x - 


1 


= x 2 - 1. 
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By Theorem 4.7 we know that this is the desired linearly independent solution. The 
general solution of Equation (4.16) may thus be expressed as the linear combination 
c iX + c 2 (x 2 — 1) of the linearly independent solutions / and g. We thus write the 
general solution of Equation (4.16) as 

y = c,x + c 2 (x 2 - l). 


D. The Nonhomogeneous Equation 

We now return briefly to the nonhomogeneous equation 

d n v d n ~ l y dy 

a °^dx" + a '^dx irrT + + a "~ 1 ^dx + a "^ y = 

The basic theorem dealing with this equation is the following. 


THEOREM 4.8 
Hypothesis 

(7) Let v be any solution of the given (nonhomogeneous) nth-order linear differential 
equation (4.1). (2) Let u be any solution of the corresponding homogeneous equation 

d n v d nl v dv 

a°W^ + aiW^ r rV + •• + a„-j(x)-^ + a„(x)y = 0. (4.2) 

Conclusion. Then u -f v is also a solution of the given (nonhomogeneous) equation 
(4.1). 


► Example 4.1 7 

Observe that y = x is a solution of the nonhomogeneous equation 

d 2 y 
dx 2 


-jzi+y = x - 


and that y = sin x is a solution of the corresponding homogeneous equation 

d 2 y 


Then by Theorem 4.8 the sum 

sin x + x 

is also a solution of the given nonhomogeneous equation 

d 2 y 
dx 2 

The student should check that this is indeed true. 


TT + = *• 
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Now let us apply Theorem 4.8 in the case where v is a given solution y p of the 
nonhomogeneous equation (4.1) involving no arbitrary constants, and u is the general 
solution 


y c = c l y l + c 2 y 2 + * * * + c n y n 

of the corresponding homogeneous equation (4.2). Then by this theorem, 


y c + y P 

is also a solution of the nonhomogeneous equation (4. 1 ), and it is a solution involving n 
arbitrary constants c 1? c 2 ,..., c n . Concerning the significance of such a solution, we 
now state the following result. 


THEOREM 4.9 
Hypothesis 

(7) Let y p be a given solution of the nth-order nonhomogeneous linear equation (4.1) 
involving no arbitrary constants. (2) Let 

y c = c i y i +c 2 y 2 + ••• + c H y H 

be the general solution of the corresponding homogeneous equation (4.2). 

Conclusion. Then every solution 0 of the nth-order nonhomogeneous equation (4.1) 
can be expressed in the form 

y c + y P , 

that is, 


c iyi + c 2y2 + -- - + c n y n + y P 

for suitable choice of the n arbitrary constants c i ,c 2 ,...,c n . 

This result suggests that we call a solution of Equation (4.1) of the form y c + y P ,a 
general solution of (4.1), in accordance with the following definition: 


DEFINITION 

Consider the nth-order (nonhomogeneous) linear differential equation 

d n v d n ~ l v dv 

a °^dx" + ai ^dx irT+ + a "~ 1 ^dx + a "^ y = ( 41 > 

and the corresponding homogeneous equation 

d n v d n ~^v dv 

a °( X ) ~dx n dx n ~~* + * * * ~*~ a n-i( x ) ^ + a m (x)y = (4-2) 

1. The general solution of (4.2) is called the complementary function of Equation (4.1). 
We shall denote this by y c . 
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2. Any particular solution of (4.1) involving no arbitrary constants is called a particular 
integral of (4.1). We shall denote this by y p . 

3. The solution y c + y p of (4.1), where y c is the complementary function and y p is a 
particular integral of (4.1), is called the general solution of (4.1). 

Thus to find the general solution of (4.1), we need merely find: 

1. The complementary function, that is, a “general” linear combination of n linearly 
independent solutions of the corresponding homogeneous equation (4.2); and 

2. A particular integral, that is, any particular solution of (4.1) involving no arbitrary 
constants. 


► Example 4.18 

Consider the differential equation 


d*y 

dx 


j + y = x - 


The complementary function is the general solution 

y c = Ci sin x + c 2 cos x 
of the corresponding homogeneous equation 


d 2 y 

dx 2 


XT + ^ = 0. 


A particular integral is given by 


yp = x. 

Thus the general solution of the given equation may be written 
y = y c + y p = c x sin x -f c 2 cos x -f x. 

In the remaining sections of this chapter we shall proceed to study methods of 
obtaining the two constituent parts of the general solution. 

We point out that if the nonhomogeneous member F(x) of the linear differential 
equation (4.1) is expressed as a linear combination of two or more functions, then the 
following theorem may often be used to advantage in finding a particular integral. 


THEOREM 4.10 


Hypothesis 

1. Let j\ be a particular integral of 


/ x d n y d n l y 

+ M£TT + 


dy 

+ a "~ 1 ^ dx + a "^ y = Fl ^' 


(4.17) 
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2. Let f 2 be a particular integral of 


a 0 (x) 


d n y 

dx n 


+ fliM 


d n ~ l y 

dx n ~ l 


dy 

+ ••• + a n _fx)— + a n {x)y = F 2 (x). 


(4.18) 


Conclusion. Then k l f 1 + k 2 f 2 is a particular integral of 


d n y 


dy 


«o(x)^ + «iW^rr+ ••• + — + a n (x)y = fcjF^x) + fc 2 F 2 (x), 


dx 


dx 


(4.19) 


w/iere k t and k 2 are constants . 


► Example 4.19 


Suppose we seek a particular integral of 


d 2 y 

dx 2 


■ 3 -j + 3 / = 3x + 5 tan x. 


We may then consider the two equations 

d 2 y 


dx 


2 + y = x 


and 


d 2 y 


dx 


2 + y = tan x. 


(4.20) 


(4.21) 


(4.22) 


We have already noted in Example 4.18 that a particular integral of Equation (4.21) is 
given by 


y = x. 

Further, we can verify (by direct substitution) that a particular integral of Equation 
(4.22) is given by 


y = —(cos x)ln |sec x + tan x|. 

Therefore, applying Theorem 4.10, a particular integral of Equation (4.22) is 
y = 3x — 5(cos x)ln |sec x + tan x|. 

This example makes the utility of Theorem 4. 1 0 apparent. The particular integral y = x 
of (4.21) can be quickly determined by the method of Section 4.3 (or by direct 
inspection!), whereas the particular integral 

y = —(cos x)ln |sec x + tan x| 

of (4.22) must be determined by the method of Section 4.4, and this requires 
considerably greater computation. 
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Exercises 


1. Given that y = x is a solution of 

7 d 2 y A dy 
x 2 -3—=- - 4x — + Ay = 0, 
ax ax 

find a linearly independent solution by reducing the order. Write the general 
solution. 

2. Given that y = x + 1 is a solution of 

{x + l) 2 i^~ 3(x + l) ^ + 3y = 0 ’ 

find a linearly independent solution by reducing the order. Write the general 
solution. 


3. Given that y = x is a solution of 

.2 <J 2 y 


(x 2 - 1 ) ^-- 2 x^+ 2 y = °, 


dy_ 

dx 


find a linearly independent solution by reducing the order. Write the general 
solution. 

4. Given that y = x is a solution of 

(x 2 - X + l)±4- (x 2 + x)!j- + (x + 1 )y = 0, 
dx 2 dx 

find a linearly independent solution by reducing the order. Write the general 
solution. 


5. Given that y = e 2x is a solution of 


d 2 y 

] dP 


(2x + D^-4(x+l)^ + 4y = 0, 


find a linearly independent solution by reducing the order. Write the general 
solution. 


6. Given that y = x 2 is a solution of 
d 2 y 


dy 


(x 3 - x 2 ) — (x 3 + 2x 2 - 2x) + (2x 2 + 2x — 2)y = 0, 

dx dx 

find a linearly independent solution by reducing the order. Write the general 
solution. 

7. Prove Theorem 4.8 for the case n = 2. That is, prove that if u is any solution of 

d 2 y , x dy 

] d^ + ai[x) d^ 

and v is any solution of 

d 2 y , . dy 

) d^ + ai(x) d^ 

then u + v is also a solution of this latter nonhomogeneous equation. 


a o (*)t4 + a i + = 0 


fl 0 w t 4 + a i w T7 + a 2 (x)y = F(x), 
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8. Consider the nonhomogeneous differential equation 


d^y 

dx 2 


3 ^ + 2 

dx 


(a) Show that e x and e 2x are linearly independent solutions of the corresponding 
homogeneous equation 


d 2 y 

dx 2 



+ 2y = 0. 


(b) What is the complementary function of the given nonhomogeneous 
equation? 

(c) Show that lx 2 + 6x + 7 is a particular integral of the given equation. 

(d) What is the general solution of the given equation? 


9. Given that a particular integral of 


<Py_ 5 dy 
dx 2 dx 


+ 6y=l 


1 

,s y-e 


a particular integral of 

d 2 y 

dx 2 



is 


y 


x 5_ 
6 + 36’ 


and a particular integral of 


d 2 y 5 dy 
dx 2 dx 


+ 6y = e x 


is y = 


2 ’ 


use Theorem 4.10 to find a particular integral of 


, dy 
dx 2 dx 


+ 6y = 2 — \2x + 6e x . 


4.2 THE HOMOGENEOUS LINEAR EQUATION WITH CONSTANT COEFFICIENTS 
A. Introduction 


In this section we consider the special case of the nth-order homogeneous linear 
differential equation in which all of the coefficients are real constants. That is, we shall 
be concerned with the equation 


d n y 

a °d^ + ai 


d" 


dy 


dx ^ + ' + a "- 1 ^ + a " > '- 0 


(4.23) 


where a 0 , a ! , . . . , a n _ x , a n are real constants. We shall show that the general solution of 
this equation can be found explicitly. 

In an attempt to find solutions of a differential equation we would naturally inquire 
whether or not any familiar type of function might possibly have the properties that 
would enable it to be a solution. The differential equation (4.23) requires a function / 
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having the property such that if it and its various derivatives are each multiplied by 
certain constants, the a h and the resulting products, a i f (n ~ i) , are then added, the result 
will equal zero for all values of x for which this result is defined. For this to be the case 
we need a function such that its derivatives are constant multiples of itself. Do we know 
of functions / having this property that 

d^ 

dx k 


-J- k [/(*)] = cf(x) 


for all x? The answer is “yes,” for the exponential function / such that /(x) = e mx , where 
m is a constant, is such that 


d* 

dx 


k ( e mx ) = m k e mx . 


Thus we shall seek solutions of (4.23) of the form y = e mx , where the constant m will be 
chosen such that e mx does satisfy the equation. Assuming then that y = e mx is a solution 
for certain m, we have: 


dx 

d 2 y 

dx 2 


= me 


= rn 2 o mx 


d n y 

dx n 


= me 


Substituting in (4.23), we obtain 

a 0 rn n e mx + a l m n ~ i e mx + ••• + a n _ l me mx + a n e mx = 0 
or 

e mx (a 0 rn n + a l m n ~ l + • • • + a n _ t m 4- a n ) = 0. 

Since e mx # 0, we obtain the polynomial equation in the unknown m: 

a 0 m n + a l m n ~ l +‘*‘ + fl„_ 1 m + a„ = 0. (4.24) 

This equation is called the auxiliary equation or the characteristic equation of the given 
differential equation (4.23). If y = e mx is a solution of (4.23) then we see that the 
constant m must satisfy (4.24). Hence, to solve (4.23), we write the auxiliary equation 
(4.24) and solve it for m. Observe that (4.24) is formally obtained from (4.23) by merely 
replacing the /cth derivative in (4.23) by m k (k = 0, 1,2, ...,m). Three cases arise, 
according as the roots of (4.24) are real and distinct, real and repeated, or complex. 


B. Case 1 . Distinct Real Roots 

Suppose the roots of (4.24) are the n distinct real numbers 



4.2 THE HOMOGENEOUS LINEAR EQUATION WITH CONSTANT COEFFICIENTS 127 


Then 


e m\x e m 2 x e m n x 

are n distinct solutions of (4.23). Further, using the Wronskian determinant one may 
show that these n solutions are linearly independent. Thus we have the following result. 


THEOREM 4.1 1 

Consider the nth-order homogeneous linear differential equation (4.23) with constant 
coefficients . If the auxiliary equation (4.24) has the n distinct real roots m 1? m 2 , . . . , m n , 
then the general solution of (4.23) is 

y = c l e miX + c 2 e miX + ••• 4- c n e mnX , 

where c l ,c 2 ,...,c n are arbitrary constants. 


► Example 4.20 


Consider the differential equation 


The auxiliary equation is 


Hence 


m 2 — 3m -f 2 = 0. 


(m — 1 )(m — 2) = 0, = 1, m 2 = 2. 

The roots are real and distinct. Thus e x and e 2x are solutions and the general solution 
may be written 

y = c i e x + c 2 e 2x . 

We verify that e x and e 2x are indeed linearly independent. Their Wronskian is 

0 2x\ 


W(e x , e 2x ) = 


e e 
e x 2e 2x 


= e 3x / 0. 


Thus by Theorem 4.4 we are assured of their linear independence. 


► Example 4.21 


Consider the differential equation 


d 3 y d 2 y 
dx 3 dx 2 


+ ? + 6y-0. 

dx 


The auxiliary equation is 

m 3 — Am 2 -f m + 6 = 0. 
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We observe that m = — 1 is a root of this equation. By synthetic division we obtain the 
factorization 

(m + 1 )(m 2 — 5m + 6) = 0 
or 

(m -f 1 )(m — 2)(m — 3) = 0. 

Thus the roots are the distinct real numbers 

m t = — 1, m 2 = 2 , m 3 = 3, 

and the general solution is 

y = c l e~ x + c 2 e lx + c 3 e 3x . 


C. Case 2. Repeated Real Roots 

We shall begin our study of this case by considering a simple example. 


► Example 4.22: Introductory Example 


Consider the differential equation 


d 2 


dy 




(4.25) 


The auxiliary equation is 


m 2 — 6m + 9 = 0 


or 


(m - 3) 2 = 0. 


The roots of this equation are 


m { = 3, m 2 = 3 


(real but not distinct). 

Corresponding to the root we have the solution e 3x , and corresponding to m 2 we 
have the same solution e 3x . The linear combination c t e 3x + c 2 e 3x of these “two” so- 
lutions is clearly not the general solution of the differential equation (4.25), for it is not a 
linear combination of two linearly independent solutions. Indeed we may write the 
combination c t e 3x + c 2 e 3x as simply c 0 e 3x , where c 0 = c t + c 2 ; and clearly y = c 0 e 3x , 
involving one arbitrary constant, is not the general solution of the given second - order 
equation. 

We must find a linearly independent solution; but how shall we proceed to do so? 
Since we already know the one solution e 3x , we may apply Theorem 4.7 and reduce the 
order. We let 


3 v 

y = e v. 
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where v is to be determined. Then 


*L = e 

dx dx 


= - 3x — + 3e 3x t>, 


— = e 3x — + 6<? 3x — + 9e 3x v 
dx 2 dx 2 + dx + 


Substituting into Equation (4.25) we have 


. d 2 v 
dx 2 


o 3x 


dv 

dx 


,3x. 


6 e 


. dv 
dx 


+ 3e 3x t; + 9e 3x u = 0 


or 


~3x 


d 2 y 

dx 2 


Letting w = dv/dx, we have the first-order equation 

dx 


or simply 


The solutions of this first-order equation are simply w = c, where c is an arbitrary 
constant. Choosing the particular solution w = 1 and recalling that dv/dx = w, we find 

v(x) = x + c 0 , 

where c 0 is an arbitrary constant. By Theorem 4.7 we know that for any choice of the 
constant c 0 , v(x)e 3x = (x 4- c 0 )e 3x is a solution of the given second-order equation 
(4.25). Further, by Theorem 4.7, we know that this solution and the previously known 
solution e 3x are linearly independent. Choosing c 0 = 0 we obtain the solution 

3 x 

y = xe , 

and thus corresponding to the double root 3 we find the linearly independent solutions 

e 3x and xe 3x 


of Equation (4.25). 

Thus the general solution of Equation (4.25) may be written 

y = c l e 3x + c 2 xe 3x 
or 

y = (c l + c 2 x)e 3x . 


(4.26) 

(4.27) 


With this example as a guide, let us return to the general nth-order equation (4.23). If 
the auxiliary equation (4.24) has the double real root m, we would surely expect that e mx 
and xe mx would be the corresponding linearly independent solutions. This is indeed the 
case. Specifically, suppose the roots of (4.24) are the double real root m and the (n — 2) 
distinct real roots 
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Then linearly independent solutions of (4.23) are 

e mx xe *nx e mix ^ m 2 x e m n -2X 

and the general solution may be written 

y = c l e mx + c 2 xe mx + c 3 e miX -f c 4 e miX -f -f c n e mn ~ 2X 
or 

y = (c x + c 2 x)e mx + c 3 e miX -f c^e m2X -f ••• -f c n e mn ~ 2X . 

In like manner, if the auxiliary equation (4.24) has the triple real root m, 
corresponding linearly independent solutions are 

e mx , xe mx , and x 2 e mx . 

The corresponding part of the general solution may be written 

(c t + c 2 x -f c 3 x 2 )e mx . 

Proceeding further in like manner, we summarize Case 2 in the following theorem: 

THEOREM 4.1 2 

1. Consider the nth-order homogeneous linear differential equation (4.23) with 
constant coefficients. If the auxiliary equation (4.24) has the real root m occurring k times , 
then the part of the general solution of (4.23) corresponding to this k-fold repeated root is 

(Ci + c 2 x + c 3 x 2 + ••• + c k x k ~ l )e mx . 

2. If further, the remaining roots of the auxiliary equation (4.24) are the distinct real 
numbers m k+l ,...,m n , then the general solution of (4.23) is 

y = (c L + c 2 x + c 3 x 2 + ••• + c k x k ~ l )e mx + c k+l e mk + lX + ••• + c n e mnX . 

3. If however, any of the remaining roots are also repeated, then the parts of the 
general solution of (4.23) corresponding to each of these other repeated roots are 
expressions similar to that corresponding to m in part 1. 

We now consider several examples. 


► Example 4.23 


Find the general solution of 

d 3 y 


d 2 ' 


dx 3 ~ 4 : d^~ 3 ^ +m -°- 


The auxiliary equation 

m 3 - 4m 2 - 3m + 18 = 0 
has the roots, 3, 3, —2. The general solution is 

y = c l e 3x + c 2 xe 3x -f c 3 e~ 4 
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or 


y = (c t + c 2 x)e 3x 4- c 3 e 


► Example 4.24 

Find the general solution of 


<Py c d3 y , r d2 y , „ d y 

dx 

The auxiliary equation is 


^- 5 ^ + 6 s? +4 ^- 8 ^°- 


m 4 — 5m 3 4- 6 m 2 4- 4m — 8 = 0, 

with roots 2,2,2, — 1. The part of the general solution corresponding to the three-fold 
root 2 is 

y I = (c t + c 2 x + c 3 x 2 )e 2x 
and that corresponding to the simple root — 1 is simply 

y 2 = c 4 e" x . 

Thus the general solution is y = y i + y 2 , that is, 

y = (ci + c 2 x + c 3 x 2 )e 2x + c x e~ x . 


D. Case 3. Conjugate Complex Roots 

Now suppose that the auxiliary equation has the complex number a 4- bi (a, b real, i 2 = 
— 1, b / 0) as a nonrepeated root. Then, since the coefficients are real, the conjugate 
complex number a — bi is also a nonrepeated root. The corresponding part of the 
general solution is 

k i e ia+bi ^ x 4* k 2 e ia ~ bi)x , 

where k t and k 2 are arbitrary constants. The solutions defined by e (a + bi)x and e (a ~ bi)x 
are complex functions of the real variable x. It is desirable to replace these by two real 
linearly independent solutions. This can be accomplished by using Euler’s formula, 

e ie = cos 9 + i sin 9* 

which holds for all real 9. Using this we have: 

k ie {a+bi)x + k 2 e ia ~ bi)x = k Y e ax e bix + k 2 e ax e~ bix 
= e ax tk l e ibx + k 2 e~ ibx ~} 

= e ax [k t ( cos bx + i sin bx) + k 2 (cos bx — i sin fox)] 

= e ax l(k l -f k 2 )cos bx + i(k t — /c 2 )sin fox] 

= e ax [c t sin fox 4- c 2 cos fox], 


* We borrow this basic identity from complex variable theory, as well as the fact that e ax+b, *= e ax e ibx 
holds for complex exponents. 
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where c x = i(k t — k 2 ), c 2 = k t + k 2 are two new arbitrary constants. Thus the part of 
the general solution corresponding to the nonrepeated conjugate complex roots a + bi 
is 


e ax \_c l sin bx -F c 2 cos bx]. 

Combining this with the results of Case 2, we have the following theorem covering 
Case 3. 


THEOREM 4.13 

1. Consider the nth-order homogeneous linear differential equation (4.23) with 
constant coefficients. If the auxiliary equation (4.24) has the conjugate complex roots 
a -f bi and a — bi , neither repeated , then the corresponding part of the general solution of 
(4.23) may be written 

y = e ax (c t sin bx + c 2 cos bx). 

2. If however , a -f bi and a — bi are each k-fold roots of the auxiliary equation (4.24), 
then the corresponding part of the general solution of (4.23) may be written 

y = e ax [(c t + c 2 x + c 3 x 2 + • • • + c k x k ~ l ) sin bx 

+ (c*+i + c k+2 x + C k+ 3 x 2 + ■■■ + c 2k x k ~ l )cos bx']. 
We now give several examples. 


► Example 4.25 


Find the general solution of 




We have already used this equation to illustrate the theorems of Section 4. 1 . Let us now 
obtain its solution using Theorem 4.13. The auxiliary equation m 2 + 1=0 has the 
roots m = ±i. These are the pure imaginary complex numbers a ± bi, where a = 0, 
b = 1. The general solution is thus 

y = e 0x (c t sin 1 • x -f c 2 cos 1 * x), 

which is simply 

y = c t sin x + c 2 cos x. 


► Example 4.26 


Find the general solution of 


<Cy 6 dy 
dx 2 dx 


+ 25 y = 0. 
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The auxiliary equation is m 2 — 6m + 25 = 0. Solving it, we find 

6 + ^36 — 100 6 + 8 i 

m = — =^— = = 3 + 41 

2 2 

Here the roots are the conjugate complex numbers a + bi , where a = 3, b = 4. The 
general solution may be written 

y = e 3x (c l sin 4x + c 2 cos 4x). 


► Example 4.27 


Find the general solution of 


d 4 y d 3 y d 2 y 
— — — 4 — — + 14 — — 
dx 4 dx 3 dx 2 



+ 25y = 0. 


The auxiliary equation is 

m 4 — 4m 3 + 14m 2 — 20 m + 25 = 0. 


The solution of this equation presents some ingenuity and labor. Since our purpose in 
this example is not to display our mastery of the solution of algebraic equations but 
rather to illustrate the above principles of determining the general solution of dif- 
ferential equations, we unblushingly list the roots without further apologies. 

They are 

1 + 2 i, 1 - 2 i, 1 + 2 i, 1 - 21 

Since each pair of conjugate complex roots is double, the general solution is 
y = e x [(c l + c 2 x)sin 2x + (c 3 + c 4 x)cos 2x] 


or 


y = c l e x sin 2x + c 2 xe x sin 2x + c 3 e x cos 2x + c 4 xe x cos 2x. 


E. An Initial-Value Problem 

We now apply the results concerning the general solution of a homogeneous linear 
equation with constant coefficients to an initial-value problem involving such an 
equation. 


► Example 4.28 


Solve the initial-value problem 


d 2 y , dy 
dx 2 dx 


+ 25 y - 0, 


(4.28) 


y(0) = - 3, 
/( 0 ) = — 1 . 


(4.29) 

(4.30) 
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First let us note that by Theorem 4.1 this problem has a unique solution defined for all 
x, - oo < x < oo. We now proceed to find this solution; that is, we seek the particular 
solution of the differential equation (4.28) that satisfies the two initial conditions (4.29) 
and (4.30). We have already found the general solution of the differential equation 
(4.28) in Example 4.26. It is 

y = e 3x (c t sin 4x -F c 2 cos 4x). (4.31) 

From this, we find 

= e 3x [(3c l — 4c 2 )sin 4x -F (4^ + 3c 2 )cos 4x]. (4.32) 

ax 

We now apply the initial conditions. Applying condition (4.29), y(0) = — 3, to Equation 
(4.31), we find 

- 3 = e 0 (c t sin 0 + c 2 cos 0), 

which reduces at once to 


c 2 = -3. 

Applying condition (4.30), j/(0) = - 1, to Equation (4.32), we obtain 
- 1 = e°l(3c l - 4c 2 )sin 0 + (4 c t + 3c 2 )cos 0], 


which reduces to 


(4.33) 


4c l + 3c 2 = — 1. (4.34) 

Solving Equations (4.33) and (4.34) for the unknowns c t and c 2 , we find 

c i = 2 , c 2 = -3. 

Replacing c r and c 2 in Equation (4.31) by these values, we obtain the unique solution of 
the given initial-value problem in the form 

y = e 3x (2 sin 4x — 3 cos 4x). 


Recall from trigonometry that a linear combination of a sine term and a cosine term 
having a common argument cx may be expressed as an appropriate constant multiple 
of the sine of the sum of this common argument cx and an appropriate constant angle 
0. Thus the preceding solution can be reexpressed in an alternative form involving the 
factor sin(4x + 0) for some suitable 0. To do this we first multiply and divide by 
y/(2 2 ) + (- 3) 2 = yi3, thereby obtaining 

r 2 _ 3 

—= sin 4x cos 4x . 

_yi3 yn j 

From this we may express the solution in the alternative form 

y = ^l3c 3x sin(4x + 0), 
where the angle 0 is defined by the equations 


y = yi3e 3 1 


sin cp — 


3 

7*3’ 


COS 4 > = 


2 

yiy 
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Exercises 

Find the general solution of each of the differential equations in Exercises 1-24. 

« d 2 y dy r n J2 

L d^~ 5 d^ + 6y ~°- 


5. 


7. 


9. 


11 . 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20 . 


21 . 


22 . 


23. 


d 2 y „ dy 


3. 4^-1 2^ + 5, -». 
ax dx 


d 3 y 
dx 3 




- 3 i #-£ +3 '- 0 


g-*£ 4 >*-* 


d 2 )* 

dx 2 


■4^+ 13y = 0. 
dx 


3y = 0. 


d 2 v 

dx 3 dx 2 dx 

d 3 y 


dx 3 

d 3 y 

dx 3 




+ 4 


dy 


dx 1 


+ 5 / + 6y = 0. 
dx 


d 3 y d 2 y | dy = 
dx 3 dx 2 dx y 

d 4 y d 2 y 

+ 8 t 4+ 16y = 0. 

ax^ 

^ + ^Z_o 

dx 4 + dx 3 


dx 4 

d 5 y 

dx 5 

d 4 y 

dx 4 

d 4 y 

dx 4 


- 2 


d 3 y . d 2 y dy 

^- 3 ^ + i +2j - ft 


dx 3 

d^y 

dx 3 


6 . 


10 . 


12 . 


• 2 -4- + 2 — + 12y = 0. 
dx dx 




3 ^- 14 ^ 
dx 2 dx 


■ 5y = 0. 


d 3 y 

dx 3 


d 2 y 

* dx 


_ 6 ~ y , c 


■y + 5 -y— + 12y — 0. 
dx 


d 2 y 


4 ^ +4 £ +,,=o ' 


d 2 


4 0 4 -° 


£ 4 ^ 4 ' 5 £ +2 “£ + ' 2 -°' 

d*y 


24. 


dx 


4-o. 
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Solve the initial-value problems in Exercises 25-42: 


25. 


40) = 3, 

4(0) = 5. 

26. 


y(0) = -4, 

4(0) = 2. 

27. 


40) = 1, 

4(0) = 6. 

28. 


40) = 2, 

4(0) = -4. 

29. 


40) = 2, 

4(0)= -3. 

30. 

4 &- i2 £ + ^-°- 

y(0) = 4, 

4(0) = 9. 

31. 

S+ 4 f+ 4 -* 

40) = 3, 

4(0) = 7. 

32. 


40) = 3, 

4(0)= -1. 

33. 

0- 4 s +2 ^-°- 

40) = o, 

4(0) = 5. 

34. 

&*4**'-* 

40)= -l, 

4(0) = 5. 

35. 

^+4 +i3 ^°- 

ax ax 

40) = 3, 

4(0)= -1. 

36. 

d 2 y ,dy 
— + 2-+5y~0, 

40) = 2, 

4(0) = 6- 

37. 

9 4 + 4 + ^-°- 

ax ax 

>-(0) = 6, 

4(0) = o. 

38. 

4 ft + 4 ^ + 37 y = °, 

ax ax 

40) = 2, 

4(0) = - 4. 

39. 

S- 6 S +, 4- 63=o - f(o)=o - /,o) -°' 3 ' ,o)=2 

40. 


= 0, 40) 

= 2, 4(0) = o, 4(0) = o. 

41. 

0- 3 0 44 -»- 

40) = i, 

4(0) = — 8, 4(0)= -4. 

42. 


= 0, >-(0) 

= o, 4(0) = 1, 4(0) = 6. 
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43. The roots of the auxiliary equation, corresponding to a certain lOth-order 
homogeneous linear differential equation with constant coefficients, are 

4, 4, 4, 4, 2 -b 3i, 2 — 3 i, 2 -b 3 i, 2 — 3i, 2 -b 3 i, 2 — 3 i. 

Write the general solution. 

44. The roots of the auxiliary equation, corresponding to a certain 12th-order 
homogeneous linear differential equation with constant coefficients, are 

2, 2, 2, 2, 2, 2, 3 ~b 4 i, 3 — 4 i, 3 -b 4i, 3 — 4f, 3 -b 4 i, 3 — 4i. 

Write the general solution. 

45. Given that sin x is a solution of 


d 4 y 


^ + 2 ^ +6 ^ + 2 ^ + 5y ~ 0 ’ 


d 2 y 


dy_ 

dx 


find the general solution. 

46. Given that e x sin 2x is a solution of 

d 4 y , - d 3 y d 2 y 


dy 


^ + 3 ^ + ^ +t3 £ + 3 °r- 0 - 


find the general solution. 


4.3 THE METHOD OF UNDETERMINED COEFFICIENTS 


A. Introduction; An Illustrative Example 


We now consider the (nonhomogeneous) differential equation 


d n y d n ~ l 

a °d^ + ai d^ 


v dv 

— + * * * + - 1 — + a n y = F(x), 


(4.35) 


where the coefficients a 0 , a t , . . . , a n are constants but where the nonhomogeneous term 
F is (in general) a nonconstant function of x. Recall that the general solution of (4.35) 
may be written 

y = y c + 

where y c is the complementary function , that is, the general solution of the correspond- 
ing homogeneous equation (Equation (4.35) with F replaced by 0), and y p is a particular 
integral , that is, any solution of (4.35) containing no arbitrary constants. In Section 4.2 
we learned how to find the complementary function; now we consider methods of 
determining a particular integral. 

We consider first the method of undetermined coefficients . Mathematically speaking, 
the class of functions F to which this method applies is actually quite restricted; but this 
mathematically narrow class includes functions of frequent occurrence and consid- 
erable importance in various physical applications. And this method has one distinct 
advantage — when it does apply , it is relatively simple! 
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► Example 4.29: Introductory Example 

gf- 2 £-3,-2^ (4.36, 

We proceed to seek a particular solution y p ; but what type of function might be a 
possible candidate for such a particular solution? The differential equation (4.36) 
requires a solution which is such that its second derivative, minus twice its first 
derivative, minus three times the solution itself, add up to twice the exponential 
function e 4x . Since the derivatives of e 4x are constant multiples of e 4x , it seems 
reasonable that the desired particular solution might also be a constant multiple of e 4x . 
Thus we assume a particular solution of the form 

= Ae 4x , (4.37) 

where A is a constant (undetermined coefficient) to be determined such that (4.37) is a 
solution of (4.36). Differentiating (4.37), we obtain 

y' p = AAe 4x and y" p = 16/le 4x . 

Then substituting into (4.36), we obtain 

16/le 4x - 2(4Ae 4x ) - 3Ae 4x = 2e 4x 


or 


5Ae 4x = 2e 4x . (4.38) 

Since the solution (4.37) is to satisfy the differential equation identically for all x on 
some real interval, the relation (4.38) must be an identity for all such x and hence the 
coefficients of e 4x on both sides of (4.38) must be respectively equal. Equating these 
coefficients, we obtain the equation 

5 A = 2, 


from which we determine the previously undetermined coefficient 


A=l 


Substituting this back into (4.37), we obtain the particular solution 


y p = \e Ax . 


Now consider the differential equation 

d 2 y „ dy 


dx‘ 2 £- 3y = 2e ” 


(4.39) 


which is exactly the same as Equation (4.36) except that e 4x in the right member has 
been replaced by e 3x . Reasoning as in the case of differential equation (4.36), we would 
now assume a particular solution of the form 

y p = Ae 3x . 

Then differentiating (4.40), we obtain 

y' p = 3 Ae 3x and y" = ( )A^ 3x . 


(4.40) 
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Then substituting into (4.39), we obtain 

9Ae 3x - 2(3Ae 3x ) - 3(Ae 3x ) = 2e 3x 


or 


0 • Ae 3x = 2e 3x . 


or simply 

0 = 2e 3x , 

which does not hold for any real x. This impossible situation tells us that there is no 
particular solution of the assumed form (4.40). 

As noted, Equations (4.36) and (4.39) are almost the same, the only difference 
between them being the constant multiple of x in the exponents of their respective 
nonhomogeneous terms 2e 4x and 2e 3x . The equation (4.36) involving 2e 4x had a 
particular solution of the assumed form Ae 4x , whereas Equation (4.39) involving 2e 3x 
did not have one of the assumed form Ae 3x . What is the difference in these two so 
apparently similar cases? 

The answer to this is found by examining the solutions of the differential equation 


d 2 y 

dx 2 




(4.41) 


which is the homogeneous equation corresponding to both (4.36) and (4.39). The 
auxiliary equation is m 2 - 2m — 3 = 0 with roots 3 and - 1; and so 

<? 3x and e~ x 


are (linearly independent) solutions of (4.41). This suggests that the failure to obtain a 
solution of the form y p = Ae 3x for Equation (4.39) is due to the fact that the function e 3x 
in this assumed solution is a solution of the homogeneous equation (4.41) correspond- 
ing to (4.39); and this is indeed the case. For, since Ae 3x satisfies the homogeneous 
equation (4.41), it reduces the common left member 


d 2 y 2 dy 
dx 2 dx 


-3 y 


of both (4.41) and (4.39) to 0, not 2e 3x , which a particular solution of Equation (4.39) 
would have to do. 

Now that we have considered what caused the difficulty in attempting to obtain a 
particular solution of the form Ae 3x for (4.39), we naturally ask what form of solution 
should we seek? Recall that in the case of a double root m for an auxiliary equation, a 
solution linearly independent of the basic solution e mx was xe mx . While this in itself tells 
us nothing about the situation at hand, it might suggest that we seek a particular 
solution of (4.39) of the form 

y p = Axe 3x . (4.42) 

Differentiating (4.42), we obtain 

y' p = 3Axe 3x + Ae 3x , y" = 9Axe 3x + 6Ae 3x . 

a 

Then substituting into (4.39), we obtain 

(9Axe 3x + 6Ae 3x ) - 2(3 Axe 3x + Ae 3x ) - 3 Axe 3x = 2e 3x 
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or 

(9A — 6 A — 3A)xe 3x + 4Ae 3x = 2e 3x . 


or simply 


0xe 3x + 4Ae 3x = 2e 3x . 


(4.43) 


Since the (assumed) solution (4.42) is to satisfy the differential equation identically for 
all x on some real interval, the relation (4.43) must be an identity for all such x and hence 
the coefficients of e 3x on both sides of (4.43) must be respectively equal. Equating 
coefficients, we obtain the equation 

4A = 2, 


from which we determine the previously undetermined coefficient 

A = 2- 

Substituting this back into (4.42), we obtain the particular solution 

y p = \xe 3x . 


We summarize the results of this example. The differential equations 

£-4- 3,-2.- 
ax * ax 


and 




(4.36) 


(4.39) 


each have the same corresponding homogeneous equation 


d 2 y 2 dy 
dx 2 dx 


- 3y = 0. 


(4.41) 


This homogeneous equation has linearly independent solutions 

e 3x and e~ x , 

and so the complementary function of both (4.36) and (4.39) is 

y c = c l e 3x + c 2 e~ x . 

The right member 2e 4x of (4.36) is not a solution of the corresponding homogeneous 
equation (4.41), and the attempted particular solution 

= Ae 4x (4.37) 

suggested by this right member did indeed lead to a particular solution of this assumed 
form, namely, y p = je 4x . On the other hand, the right member 2e 3x of (4.39) is a 
solution of the corresponding homogeneous equation (4.41) [with c t = 2 and c 2 = 0], 
and the attempted particular solution 

= Ae 3x (4.40) 

suggested by this right member failed to lead to a particular solution of this form. 
However, in this case, the revised attempted particular solution, 

y p = Axe 3x , 


(4.42) 
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obtained from (4.40) by multiplying by x, led to a particular solution of this assumed 
form, namely, y p = \ xe 3x . 

The general solutions of (4.36) and (4.39) are, respectively, 

,, „ „3x i _ n -x i 2./i4x 

y — c i e -r c 2 e -r 


and 

y = c t e 3x + c 2 e~ x + \xe 3x . 

The preceding example illustrates a particular case of the method of undetermined 
coefficients. It suggests that in some cases the assumed particular solution y p 
corresponding to a nonhomogeneous term in the differential equation is of the same 
type as that nonhomogeneous term, whereas in other cases the assumed y p ought to be 
some sort of modification of that nonhomogeneous term. It turns out that this is 
essentially the case. We now proceed to present the method systematically. 


B. The Method 

We begin by introducing certain preliminary definitions. 


DEFINITION 

We shall call a function a UC function if it is either (7) a function defined by one of the 
following: 

(i) x", where n is a positive integer or zero , 

(ii) e ax , where a is a constant # 0, 

(iii) sin(fox + c), where b and c are constants, b # 0, 

(iv) cos (bx + c), where b and c are constants, b # 0, 

or (2) a function defined as a finite product of two or more functions of these four types. 


► Example 4.20 

Examples of UC functions of the four basic types (i), (ii), (iii), (iv) of the preceeding 
definition are those defined respectively by 

x 3 , e~ 2x , sin(3x/2), cos(2x + tc/4). 

Examples of UC functions defined as finite products of two or more of these four basic 
types are those defined respectively by 

x 2 e 3x , x cos 2x, e 5x sin 3x, 

sin 2x cos 3x, x 3 e 4x sin 5x. 

The method of undetermined coefficients applies when the nonhomogeneous 
function F in the differential equation is a finite linear combination of UC functions. 
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Observe that given a UC function /, each successive derivative of / is either itself a 
constant multiple of a UC function or else a linear combination of UC functions. 


DEFINITION 

Consider a UC function /. The set of functions consisting of f itself and all linearly 
independent UC functions of which the successive derivatives of f are either constant 
multiples or linear combinations will be called the UC set of f 


► Example 4.31 

The function / defined for all real x by /(x) = x 3 is a UC function. Computing 
derivatives of f we find 

/' (x) = 3x 2 , f" (x) = 6x, /'" (x) = 6 = 6 • 1, / ,B) (x) = 0 for n > 3. 

The linearly independent UC functions of which the successive derivatives of / are 
either constant multiples or linear combinations are those given by 

x 2 , x, 1. 

Thus the UC set of x 3 is the set S = {x 3 ,x 2 ,x, 1}. 


► Example 4.32 

The function / defined for all real x by /(x) = sin 2x is a UC function. Computing 
derivatives of f we find 

/' (x) = 2 cos 2x, /" (x) = - 4 sin 2x, 

The only linearly independent UC function of which the successive derivatives of / are 
constant multiples or linear combinations is that given by cos 2x. Thus the UC set of 
sin 2x is the set S = {sin 2x, cos 2x}. 

These and similar examples of the four basic types of UC functions lead to the results 
listed as numbers 1, 2, and 3 of Table 4.1. 


► Example 4.33 

The function / defined for all real x by /(x) = x 2 sin x is the product of the two UC 
functions defined by x 2 and sin x. Hence / is itself a UC function. Computing 
derivatives of f we find 

/' (x) = 2x sin x + x 2 cos x, 

f" (x) = 2 sin x + 4x cos x — x 2 sin x, 

/"' (x) = 6 cos x — 6x sin x — x 2 cos x, 

No “new” types of functions will occur from further differentiation. Each derivative of 
/ is a linear combination of certain of the six UC functions given by x 2 sin x, x 2 cosx, 
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TABLE 4.1 



U C function 

UC set 

1 

x" 

{x",x"-‘,x"- 2 ,...,x,l} 

2 

e ax 

{«“} 

3 

sin (bx + c) or 
cos (bx + c) 

(sin(hx + c), cos (fox + c)} 

4 

x n e ax 

{x n e ax , x"~ l e ax , x"“ V x , . . . , xe ax , e ax } 

5 

x" sin(bx + c) or 
x" cos (bx + c) 

{x n sin (bx + c), x n cos (bx + c), 

x n_1 sin(fox + c), x"" 1 cos (fox + c), 

. . . , x sin(fox + c), x cos (fox + c), 
sin(fex + c), cos (bx + c)} 

6 

e ax sin (bx + c) or 
e ax cos (bx + c) 

{« e ax sin (bx + c), e ax cos (fox + c)} 

7 

x n e ax sin(fcx + c) or 
x n e ax cos (bx + c) 

{x n e ax sin (bx + c), x n e ax cos (bx + c), 

x n ~ l e ax sin(bx + c), x n ~ l e ax cos(bx + c), 

xe ax sin (bx + c), xe ax cos (^x + c), 
e ax sin (bx + c), e ax cos (fox + c)} 


x sin x, x cos x, sin x, and cos x. Thus the set 

S = {x 2 sin x, x 2 cos x, x sin x, x cos x, sin x, cos x} 
is the UC set of x 2 sin x. Note carefully that x 2 , x, and 1 are not members of this UC set. 

Observe that the UC set of the product x 2 sin x is the set of all products obtained by 
multiplying the various members of the UC set {x 2 ,x, 1} of x 2 by the various members 
of the UC set {sin x, cos x} of sin x. This observation illustrates the general situation 
regarding the UC set of a UC function defined as a finite product of two or more UC 
functions of the four basic types. In particular, suppose h is a UC function defined as the 
product fg of two basic UC functions / and g. Then the UC set of the product function 
h is the set of all the products obtained by multiplying the various members of the UC 
set of / by the various members of the UC set of g. Results of this type are listed as 
numbers 4, 5, and 6 of Table 4. 1 and a specific illustration is presented in Example 4.34. 


► Example 4.34 

The function defined for all real x by /(x) = x 3 cos 2x is the product of the two UC 
functions defined by x 3 and cos 2x. Using the result stated in the preceding paragraph, 
the UC set of this product x 3 cos 2x is the set of all products obtained by multiplying 
the various members of the UC set of x 3 by the various members of the UC set of 
cos 2x. Using the definition of UC set or the appropriate numbers of Table 4.1, we find 
that the UC set of x 3 is 


{x 3 , X 2 , X, 1} 
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and that of cos 2x is 


{sin 2x, cos 2x}. 

Thus the UC set of the product x 3 cos 2x is the set of all products of each of x 3 , x 2 , x, 
and 1 by each of sin 2x and cos 2x, and so it is 

{x 3 sin 2x, x 3 cos 2x, x 2 sin 2x, x 2 cos 2x, x sin 2x, x cos 2x, sin 2x, cos 2x}. 

Observe that this can be found directly from T able 4. 1 , number 5, with n = 3, b = 2, and 
c = 0. 


We now outline the method of undetermined coefficients for finding a particular 
integral y p of 


d n y d n l y dy 

a °d^ + ai d^ + '" + a - i d^ +a " y = 


F(x), 


where F is a finite linear combination 


F = A l u l + A 2 u 2 + ••• + A m u m 

of UC functions u l ,u 2 ,...,u m , the A { being known constants. Assuming the comple- 
mentary function y c has already been obtained, we proceed as follows: 

1. For each of the UC functions 


^1 > • • • J ^ m 

of which F is a linear combination, form the corresponding UC set, thus obtaining the 
respective sets 

Si , S 2 , . . . , S m . 

2. Suppose that one of the UC sets so formed, say Sj, is identical with or completely 
included in another, say S k . In this case, we omit the (identical or smaller) set Sj from 
further consideration (retaining the set S k ). 

3. We now consider in turn each of the UC sets which still remain after Step 2. 
Suppose now that one of these UC sets, say S t , includes one or more members which are 
solutions of the corresponding homogeneous differential equation. If this is the case, 
we multiply each member of S, by the lowest positive integral power of x so that the 
resulting revised set will contain no members that are solutions of the corresponding 
homogeneous differential equation. We now replace S t by this revised set, so obtained. 
Note that here we consider one UC set at a time and perform the indicated multi- 
plication, if needed, only upon the members of the one UC set under consideration at 
the moment. 

4. In general there now remains: 

(i) certain of the original UC sets, which were neither omitted in Step 2 nor 
needed revision in Step 3, and 

(ii) certain revised sets resulting from the needed revision in Step 3. 

Now form a linear combination of all of the sets of these two categories, with unknown 
constant coefficients ( undetermined coefficients ). 

5. Determine these unknown coefficients by substituting the linear combination 
formed in Step 4 into the differential equation and demanding that it identically satisfy 
the differential equation (that is, that it be a particular solution). 
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This outline of procedure at once covers all of the various special cases to which the 
method of undetermined coefficients applies, thereby freeing one from the need of 
considering separately each of these special cases. 

Before going on to the illustrative examples of Part C following, let us look back and 
observe that we actually followed this procedure in solving the differential equations 

(4.36) and (4.39) of the Introductory Example 4.29. In each of those equations, the 
nonhomogeneous member consisted of a single term that was a constant multiple of a 
UC function; and in each case we followed the outline procedure step by step, as far as it 
applied. 

For the differential equation (4.36), the UC function involved was e 4x ; and we formed 
its UC set, which was simply {e 4x } (Step 1). Step 2 obviously did not apply. Nor did Step 
3, for as we noted later, e 4x was not a solution of the corresponding homogeneous 
equation (4.41). Thus we assumed y p = Ae 4x (Step 4) substituted in differential equation 

(4.36) , and found A and hence y p (Step 5). 

For the differential equation (4.39), the UC function involved was e 3x ; and we formed 
its UC set, which was simply {e 3x } (Step 1). Step 2 did not apply here either. But Step 3 
was very much needed, for e 3x was a solution of the corresponding homogeneous 
equation (4.41). Thus we applied Step 3 and multiplied e 3x in the UC set { e 3x } by x, 
obtaining the revised UC set {xe 3x }, whose single member was not a solution of (4.41). 
Thus we assumed y p = Axe 3x (Step 4), substituted in the differential equation (4.39), 
and found A and hence y p (Step 5). 

The outline generalizes what the procedure for the differential equation of 
Introductory Example 4.29 suggested. Equation (4.39) of that example has already 
brought out the necessity for the revision described in Step 3 when it applies. We give 
here a brief illustration involving this critical step. 


► Example 4.35 


Consider the two equations 


d^y , dy 
dx 2 dx 


+ 2y = x 2 e x 


(4.44) 


and 


The UC set of x 2 e x is 


d 2 y ^ dy 

dx 2 dx 


+ y = x 2 e x 


S = {x 2 e x , xe x , e x }. 


(4.45) 


The homogeneous equation corresponding to (4.44) has linearly independent 
solutions e x and e 2x , and so the complementary function of (4.44) is y c = c l e x + c 2 e 2x . 
Since member e x of UC set S is a solution of the homogeneous equation corresponding 
to (4.44), we multiply each member of UC set S by the lowest positive integral power of 
x so that the resulting revised set will contain no members that are solutions of the 
homogeneous equation corresponding to (4.44). This turns out to be x itself; for the 
revised set 

S' = {x 3 e x , x 2 e*, xe*} 


has no members that satisfy the homogeneous equation corresponding to (4.44). 
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The homogeneous equation corresponding to (4.45) has linearly independent 
solutions e x and xe x , and so the complementary function of (4.45) is y c = c i e x + c 2 xe x . 
Since the two members e x and xe x of UC set S are solutions of the homogeneous 
equation corresponding to (4.45), we must modify S here also. But now x itself will not 
do, for we would get S ' , which still contains xe x . Thus we must here multiply each 
member of S by x 2 to obtain the revised set 

S" = {xV,xV,xV}, 

which has no member that satisfies the homogeneous equation corresponding to (4.45). 


C. Examples 

A few illustrative examples, with reference to the above outline, should make the 
procedure clear. Our first example will be a simple one in which the situations of Steps 2 
and 3 do not occur. 


► Example 4.36 


d 2 y 

dx 2 


2 ^ - 3y = 2e x - 10 sin x. 
dx 


The corresponding homogeneous equation is 


d 2 y 

dx 2 




and the complementary function is 

= c t e 3x + c 2 e~ x . 

The nonhomogenous term is the linear combination 2e x — 10 sin x of the two UC 
functions given by e x and sin x. 


1. Form the UC set for each of these two functions. We find 

St = {«*}. 

S 2 = {sin x, cos x}. 

2. Note that neither of these sets is identical with nor included in the other; hence 
both are retained. 

3. Furthermore, by examining the complementary function, we see that none of 
the functions e x , sin x, cos x in either of these sets is a solution of the corresponding 
homogeneous equation. Hence neither set needs to be revised. 

4. Thus the original sets and S 2 remain intact in this problem, and we form the 
linear combination 


Ae x + B sin x + C cos x 

of the three elements e x , sin x, cos x of S t and S 2 , with the undetermined coefficients 
A, B, C. 
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5. We determine these unknown coefficients by substituting the linear combination 
formed in Step 4 into the differential equation and demanding that it satisfy the dif- 
ferential equation identically. That is, we take 

y p = Ae x + B sin x + C cos x 

as a particular solution. Then 

y' p = Ae x + B cos x — C sin x, 

y p = Ae x — B sin x — C cos x. 

Actually substituting, we find 

( Ae x — B sin x — C cos x) — 2 (Ae x + B cos x — C sin x) 

— 3(Ae x + B sin x + C cos x) = 2e x — 10 sin x 


or 


— 4Ae x + ( — 415 + 2C)sin x + (-4C — 2J3)cos x = 2e x — 10 sin x. 

Since the solution is to satisfy the differential equation identically for all x on some real 
interval, this relation must be an identity for all such x and hence the coefficients of like 
terms on both sides must be respectively equal. Equating coefficients of these like terms, 
we obtain the equations 

— 4A = 2, -4 B + 2C = - 10, -4C - 2B = 0. 

From these equations, we find that 

A=-i, B = 2, C = — 1, 
and hence we obtain the particular integral 

y p = —je x + 2 sin x — cos x. 

Thus the general solution of the differential equation under consideration is 
y = y c + y p = c l e 3x + c 2 e~ x — je x + 2 sin x — cos x. 


► Example 4.37 


fy 

dx 2 


dy 

dx 


3^ + 2 y = 2x 2 + e x + 2xe x + 4e 3x . 


The corresponding homogeneous equation is 

d 2 y _ dy 

i^~ 3 d^ + 2y ~° 


and the complementary function is 


y c = c t e x + c 2 e 2x . 


The nonhomogeneous term is the linear combination 

2x 2 + e x / + 2xe x + 4e 3x 
of the four UC functions given by x 2 , e x , xe x , and e 3x . 



148 EXPLICIT METHODS OF SOLVING HIGHER-ORDER LINEAR DIFFERENTIAL EQUATIONS 


1. Form the UC set for each of these functions. We have 

5 1 = f }> 

5 2 = {«*}, 

5 3 = {xe x , e *}, 

5 4 = {e 3 *}- 

2. We note that S 2 is completely included in S 3 , so S 2 is omitted from further 
consideration, leaving the three sets 

S t = {x 2 , x, 1} S 3 = {xe x , e x }, S 4 = {e 3x }. 

3. We now observe that S 3 = {xe x , e x } includes e x 9 which is included in the 
complementary function and so is a solution of the corresponding homogeneous 
differential equation. Thus we multiply each member of S 3 by x to obtain the revised 
family 

5 3 = {x V, xe x }, 

which contains no members that are solutions of the corresponding homogeneous 
equation. 

4. Thus there remain the original UC sets 

Si = {x 2 , X, 1} 
and 

5 4 = {e 3 *} 

and the revised set 

S' 3 = {x 2 e x , xe x }. 

These contain the six elements 

x 2 , x, 1, e* x , x 2 e x , xe x . 

We form the linear combination 

,4x 2 + J3x + C + De 3x + Ex 2 e x + Fxe x 

of these six elements. 

5. Thus we take as our particular solution, 

y p = Ax 2 + Bx + C + De 3x + Ex 2 e x + Fxe x . 

From this, we have 

y' p = 2 Ax + B + 3 De 3x + Ex 2 e x + 2 Exe x + Fxe x + Fe x , 

y" p = 2/1 + 9 De 3x + Ex 2 e x + 4 Exe x + 2 Ee x + Fxe x + 2 Fe x . 

We substitute y p ,y' p ,yp into the differential equation for y, dy/dx , d 2 y/dx 2 , re- 
spectively, to obtain: 

2A + 9 De 3x + Ex 2 e x + (4 E + F)xe x + (2 E + 2 F)e x 

- 3[2Tx + B + 3 De 3x + Ex 2 e x + (2 E + F)xe x + F^ x ] 

+ 2 (Ax 2 + J3x + C + De 3x + Ex 2 e x + Fxe x ) 

= 2x 2 + e x + 2x^ x + 4^ 3x , 
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or 

(2/4 -3 5 + 2 C) + (2B - 6A)x + 2 Ax 2 + 2De 3x + (-2 E)xe x + (2 E - F)e x 

= 2x 2 + e x + 2xe x + 4e 3x . 

Equating coefficients of like terms, we have: 

2A - 35 + 2C = 0, 

25 - 6/4 = 0, 

2/4 = 2, 

2D = 4, 

— 2E = 2, 

2E-F = 1. 

From this A = 1, 5 = 3, C = D = 2, E = — 1, F = -3, and so the particular in- 
tegral is 

y p = x 2 + 3x + | + 2e 3x — x 2 e* — 3xe x . 

The general solution is therefore 

y = y c + y p = c t e x + c 2 e 2x + x 2 + 3x + | + 2e 3x — x 2 e x — 2xe x . 


► Example 4.38 


d y d 2 y 2 . 

r + —^5- = 3x + 4 sin x — 2 cos x. 
dx 4 dx 2 


The corresponding homogeneous equation is 

d^y d 2 y _ 
dx 4 + dx 2 ’ 


and the complementary function is 

y c = + c 2 x + c 3 sin x + c 4 cos x. 

The nonhomogeneous term is the linear combination 

3x 2 + 4 sin x — 2 cos x 


of the three UC functions given by 

x 2 , sin x, and cos x. 

1. Form the UC set for each of these three functions. These sets are, respectively, 

S ( = {x 2 , X, 1}, 

5 2 = {sin x, cos x}, 

5 3 — {cos x, sin x}. 

2. Observe that S 2 an< 3 S 3 are identical and so we retain only one of them, leaving the 
two sets 


Si = {x 2 , x, 1}, S 2 = {sin x, cos x}. 
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3. Now observe that = {x 2 , x, 1} includes 1 and x, which, as the complementary 
function shows, are both solutions of the corresponding homogeneous differential 
equation. Thus we multiply each member of the set S t by x 2 to obtain the revised set 

S\ = {x 4 , x 3 , x 2 }, 

none of whose members are solutions of the homogeneous differential equation. We 
observe that multiplication by x instead of x 2 would not be sufficient, since the 
resulting set would be (x 3 , x 2 , x}, which still includes the homogeneous solution x. 
Turning to the set S 2 , observe that both of its members, sin x and cos x, are also 
solutions of the homogeneous differential equation. Hence we replace S 2 by the revised 
set 


S' 2 = {x sin x, x cos x}. 

4. None of the original UC sets remain here. They have been replaced by the revised 
sets S\ and S 2 containing the five elements 

x 4 , x 3 , x 2 , x sin x, x cos x. 

We form a linear combination of these, 

Ax 4 + Bx 3 + Cx 2 + Dx sin x + Ex cos x, 

with undetermined coefficients A, B, C, D, E. 

5. We now take this as our particular solution 

y p = Ax 4 + Bx 3 + Cx 2 + Dx sin x + Ex cos x. 

Then 

y' p = 4Ax 3 + 3Bx 2 + 2 Cx + Dx cos x + D sin x — Ex sin x + E cos x, 

y'p = \2Ax 2 + 6Bx + 2 C — Dx sin x + 2D cos x — Ex cos x — 2 E sin x, 

y'p = 24Ax + 6B — Dx cos x — 3D sin x + Ex sin x — 3 E cos x, 

y ( p v) = 24 A + Dx sin x - 4D cos x + Ex cos x + 4£ sin x. 

Substituting into the differential equation, we obtain 

24/1 + Dx sin x — 4D cos x + Ex cos x + 4E sin x + \2Ax 2 + 6Bx + 2 C 
— Dx sin x + 2D cos x — Ex cos x — 2 E sin x 

= 3x 2 + 4 sin x — 2 cos x. 


Equating coefficients, we find 

24A + 2C = 0 


6B = 0 


12/4 = 3 
-2D = -2 
2E = 4. 

Hence A = %, B = 0, C = — 3, D = 1, E = 2, and the particular integral is 
y p = 4 X 4 — 3x 2 + x sin x + 2x cos x. 
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The general solution is 

y = y c + y P 

= c l + c 2 x + c 3 sin x + c 4 cos x + ^x 4 — 3x 2 + x sin x + 2x cos x. 


► Example 4.39 An Initial-Value Problem 

We close this section by applying our results to the solution of the initial-value problem 

— 2 — 3y = 2e x — 10 sin x, (4.46) 

ax ax 

y( 0) = 2, (4.47) 

/( 0) = 4. (4.48) 

By Theorem 4.1, this problem has a unique solution, defined for all x, — oo < x < oo; 
let us proceed to find it. In Example 4.36 we found that the general solution of the 
differential equation (4.46) is 

y = c t e 3x + c 2 e~ x — \e x + 2 sin x — cos x. (4.49) 

From this, we have 

— 3c^ 3x — c 2 e~ x — je x + 2 cos x + sin x. (4.50) 

ax 

Applying the initial conditions (4.47) and (4.48) to Equations (4.49) and (4.50), 
respectively, we have 

2 = c l e° + c 2 e° — 2 e° + 2 sin 0 — cos 0, 

4 = 3 c t e° — c 2 e° — \e° + 2 cos 0 + sin 0. 

These equations simplify at once to the following: 

Ci + c 2 =?, 3 Ci -c 2 = |. 

From these two equations we obtain 

c i = 2 > c 2 = 2 . 

Substituting these values for c i and c 2 into Equation (4.49) we obtain the unique 
solution of the given initial-value problem in the form 

y = je 3x + 2e~ x — je x + 2 sin x — cos x. 


Exercises 


Find the general solution of each of the differential equations in Exercises 1-24. 
< d 2 y . dy _ . 7 

1 . -JL- } jL + ly , ix ,. 


2. ^-2%-%, = ^'- 21e- 

dx z dx 


3x 
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3. 

4. 

5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 

16 . 

17. 

18. 

19. 

20 . 


+ 2 + 5y = 6 sin 2x + 7 cos 2x. 

ax 1 ax 


+ 2 + 2y = 10 sin 4x. 

ax ax 


d 2 y ^dy 
v4 + 2 — + 4); = cos 4x. 
dx z dx 


( p^-3 ( ^-4y=16x- 12e 2x ‘ 
dx 1 dx 

+ 6^- + 5y = 2e x + 10e 5x 
dx 2 dx 


ft + 2 ^ + 10 , 

ax dx 


^-r + ^4 + 3 — 5y = 5 sin 2x + 10x 2 + 3x + 7. 

dx 3 dx 2 dx 


<Py_ dy 

dx 2 dx 


t + -r 1 - 6 V = 10e 2x - 18e 3x - 6x - 11. 

^ /TV 


44 + 4~ — 2y = 6e 2x + 3e x — 4x 2 . 
dx 3 4x 

rf 3 v d 2 v 

T 4-3 T 4 + 4y = 4e*- 18e“ x . 
dx * dx 1 

dx dx dx 

d 3 y dy 

—4 + — - = 2x 2 + 4 sin x. 
ax J dx 

d ^l_ 3 ^l + 2 ^l = 3e~* + 6e 2x - 6x 
dx 4 dx 3+ dx 2 + 

44 — 6 44 + 1 1 4~ ~ 6y = xe x - 4e 2x + 6e 4x . 

dx dx z dx 

T3 - 4 44 + 5^--2y = 3x 2 e x - le*. 
dx dx dx 
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Solve the initial-value problems in Exercises 25-40. 


25. 

d 2 y 

dx 2 

i 

x K 

+ 3y = 

= 9x 2 + 4, 

y(0) = 6, 

/(0) = 8. 

26. 

d 2 y 

dx 2 

+4 

+ 4 v = 

= 16x + 20e x , 

. >(0) = 0, 

/(0) = 3. 

27. 

d 2 y 

dx 2 

-4 

+ 15y 

= 9xe 2x , 

y(0) = 5, 

/(0) = 10. 

28. 

d 2 y 

dx 2 

+ 4 

+ 10y 

= 4xe _3x , 

y(0) = o, 

/(0) = -1. 

29. 

d 2 y 

dx 2 

+4 

+ 16 y 

= 8e _2x , 

y(0) = 2, 

/(0) = 0. 

30. 

d 2 y 

dx 2 

+4 

+ 9 y = 

= 21e~ 6x , 

y(0) = -2, 

/(0) = o. 

31. 

d 2 y 

dx 2 

+ 4 

+ 13 y 

= 1 8e “ 2x , 

y(0) = o, 

/(0) = 4. 

32. 

d 2 y 

dx 2 

- 10^ + 29y = 8e 5x , 
dx 

y( 0) = o, 

/(0) = 8. 

33. 

d 2 y 

dx 2 

-4 

+ 13 y 

= 8 sin 3x, 

y(0) = i. 

/(0) = 2. 

34. 

d 2 y 

dx 2 

_ _ 

dx 

- 6>' = 

8e 2x - 5e 3x , 

y( o) = 3, 

/(0) = 5. 

35. 

d 2 y 

dx 2 

— 2 — 

dx 

+ y = 

2xe 2x + 6e x , 

y(0) = i. 

/(0) = o. 

36. 

d 2 y 

dx 2 

- y = • 

3xV, 

y(0) = i. 

/(0) = 2. 


37. 

d 2 y 

dx 2 

+ y = : 

3x 2 — < 

\ sin x, j/(0) = 0, /(0) = 1. 

38. 

d 2 y 

dx 2 

+ 4y = 

= 8 sin 2x, y(0) = 

= 6, /( 0) = 

= 8. 
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39. 


d 3 y . d 2 y dy , , . _ 

t-t - 4 —4 + +- + 6y = 3xe x + 2e x - sin x, 
dx 4 dx 2 dx 


33 


^(°) = 40’ ^°) = 0 ’ /'(°) = °- 

40. ^_6^+9^- 4) ,-8*’ + 3-fe»', 

dx dx dx 

y( 0) = 1, /(0) = 7, y"(0) = 10. 

For each of the differential equations in Exercises 41-54 set up the correct linear 
combination of functions with undetermined literal coefficients to use in finding a 
particular integral by the method of undetermined coefficients. (Do not actually find 
the particular integrals.) 

d 2 y _ 6 <ll 

dx 2 dx 


41. 

42. 

43. 


2 - 6 17 . + = X 3 + x + e 2x 


d 2 y 

dx 


2 +9 y = e 3x + e 3x + e 3x sin 3x. 


*1 + 4*1 

dx 2 dx 


2 + + 5y = e 2x (l + cos x). 


44. — 6 + 9y = x 4 e x + x 3 e 2x + x 2 e 3x . 

dx 4 dx 


45. 

46. 

47. 


d 2 y , .dy 

dx 2 dx 


2 + 6 ~ + 1 3y = xe 3x sin 2x + x 2 e 2x sin 3x. 


_ 3 + 2 — = x 2 e x + 3xe 2x + 5x 2 

dx 3 dx 2 dx 

d3y 6 d2y + \2 dy 8ii — xp 2x + x 2 e 3x 

d^~ 6 d^ +n d^~ Sy ~ Xe +X£ ■ 


.o d 4 y . d 3 y . d 2 y „ dy 2 _ x . _ xl2 V3 

48. -p4 + 3 -t-j + 4 -j4 + 3 + y = x 2 c x + 3e x/2 cos +— x. 

dx 4 dx 3 ax 2 ax 2 


49. 

50. 


d 4 y 

dx 4 


- 16 y = x 2 sin 2x + x 4 e. 


d 6 y d 5 y d 4 y , 2 _ . 

—4 + 2 — +r + 5 -T-j- = x 3 + x 2 e x + e x sin 2x. 
ax° ax 3 ax 4 


4 4 y d 2 y 2 

5I - i? + 2 s? +,,=x cos * 

d 4 y 


52. + 16y = xe^ x sin ^/ix + e cos ^/2x. 


53. 

54. 


d 4 y ~ d 2 y 2 

——r + 3 —r — 4 v = cos x — cosh x. 

dx 4 dx 2 J 


dx 


. + 10 — + 9v = sin x sin 2x. 
* dx 



4.4 


VARIATION OF PARAMETERS 155 


4.4 VARIATION OF PARAMETERS 


A. The Method 

While the process of carrying out the method of undetermined coefficients is actu- 
ally quite straightforward (involving only techniques of college algebra and 
differentiation), the method applies in general to a rather small class of problems. For 
example, it would not apply to the apparently simply equation 

d 2 y 

i? + >= mx 

We thus seek a method of finding a particular integral that applies in all cases 
(including variable coefficients) in which the complementary function is known. Such a 
method is the method of variation of parameters , which we now consider. 

We shall develop this method in connection with the general second-order linear 
differential equation with variable coefficients 

*o(x) lx 1 + Ul ^ lx + a2 ^ y = F ^' ( 451 ) 

Suppose that y t and y 2 are linearly independent solutions of the corresponding 
homogeneous equation 

a ° ^ lx 1 + a 1 ^ Jx + ° 2 (*) y = °‘ ( 4 - 52 ) 

Then the complementary function of Equation (4.51) is 

Ciyi(x) + c 2 y 2 (x), 

where y t and y 2 are linearly independent solutions of (4.52) and c Y and c 2 are arbitrary 
constants. The procedure in the method of variation of parameters is to replace the 
arbitrary constants c L and c 2 in the complementary function by respective functions v t 
and v 2 which will be determined so that the resulting function, which is defined by 

Vt(x)y t (x) + v 2 (x)y 2 (x), (4.53) 

will be a particular integral of Equation (4.51) (hence the name, variation of 
parameters). 

We have at our disposal the two functions v t and v 2 with which to satisfy the one 
condition that (4.53) be a solution of (4.51). Since we have two functions but only one 
condition on them, we are thus free to impose a second condition, provided this second 
condition does not violate the first one. We shall see when and how to impose this 
additional condition as we proceed. 

We thus assume a solution of the form (4.53) and write 

y P (x) = Oi(x)y,(x) + v 2 (x)y 2 (x). (4.54) 

Differentiating (4.54), we have 

y'p(x) = t>, (x)y\ (x) + v 2 (x)y' 2 (x) + v\ (x)y t (x) + ;/ 2 (x)y 2 (x), (4.55) 

where we use primes to denote differentiations. At this point we impose the 
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aforementioned second condition; we simplify y' p by demanding that 

v\ Wj'i (x) + v' 2 (x)y 2 (x) = 0. (4.56) 

With this condition imposed, (4.55) reduces to 

/„(*) = 1>1 M/i (x) + V 2 (x)y 2 (x). (4.57) 

Now differentiating (4.57), we obtain 

y'p(x) = Vi (x)yi(x) + v 2 (x)y 2 (x) + r' 1 (x)y 1 (x) + v' 2 (x)y' 2 (x). (4.58) 

We now impose the basic condition that (4.54) be a solution of Equation (4.5 1). Thus we 
substitute (4.54), (4.57), and (4.58) for y, dy/dx, and d 2 y/dx 2 , respectively, in Equation 
(4.51) and obtain the identity 

«o(x)DMx)y'i'(x) + v 2 (x)y 2 (x) + i/,(x)/,(x) + v' 2 (x)y' 2 (x)] 

+ diW[hW/i(x) + u 2 (x)/ 2 (x)] + a 2 (x)[u 1 (x)y 1 (x) + r 2 (x)y 2 (x)] = F(x). 
This can be written as 


Ui(x)[a 0 (x)/ 1 '(x) + a l (x)y' l (x) + a 2 (x)y 1 (x)] 

+ i’ 2 (x)[a 0 (x)/ 2 '(x) + a l (x)y' 2 (x) + a 2 (x)y 2 (x)] 

+ aoMKM/iM + f’' 2 (x)y 2 (x)] = F(x). (4.59) 


Since y, and y 2 are solutions of the corresponding homogeneous differential equation 
(4.52), the expressions in the first two brackets in (4.59) are identically zero. This leaves 
merely 

F(x) 

v\ (x)yi (x) + v 2 (x)y' 2 (x) = (4.60) 

a 0 (x) 

This is actually what the basic condition demands. Thus the two imposed conditions 
require that the functions r, and v 2 be chosen such that the system of equations 

Fi (x)u'i(x) + y 2 (x)v’ 2 (x) = 0, 

F(x) (4-61) 

y\ (x)u'i (x) + y’ 2 (x)v' 2 (x) = — — , 

a 0 (x) 


is satisfied. The determinant of coefficients of this system is precisely 


y 2 (x)] 


J'l(x) y 2 (x) 

yi(x) y’i(x ) ' 


Since v i and y 2 are linearly independent solutions of the corresponding homogeneous 
differential equation (4.52), we know that W[y Y (x), y 2 (x)] # 0. Elence the system (4.61) 
has a unique solution. Actually solving this system, we obtain 


0 y 2 (x) 

F(x) , 

(x) = -- g (x) yiX = ^l yAx) 

J'i(x) y 2 (x) a 0 W^[j' 1 (x),y 2 (x)]’ 

y.(x) y 2 (x) 
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v' 2 (x) = 


J'iW 

/iW 


0 

F(x) 

a 0 (x) 


J'iW J' 2 W 
/iW y'i(x) 


Hxyy iW 

"oW^thW- hW]' 


Thus we obtain the functions tij and v 2 defined by 


»i(x) 


v 2 (x) 


f x F(t)y 2 (t) dt 

J a 0 (t)Wl yi (t),y2(t)V 

* F(t) yi (t)dt 

aoitWlyM y 2 (t)Y 


Therefore a particular integral y p of Equation (4.51) is defined by 

y P (x ) = thM-fiM + r 2 (x))i 2 (x), 
where t>, and v 2 are defined by (4.62). 


(4.62) 


B. Examples 


► Example 4.40 


Consider the differential equation 

d 2 y 

—r + y = tan x. 

ax 


(4.63) 


The complementary function is defined by 

y c (x) = c 2 sin x + c 2 cos x. 

We assume y p (x) = ^(xjsin x + r 2 (x)cos x, (4.64) 

where the functions and v 2 will be determined such that this is a particular integral of 

the differential equation (4.63). Then 


y' p (x) = u 2 (x)cos x — u 2 (x)sin x + t/ 2 (x)sin x + t/ 2 (x)cos x. 

We impose the condition 

ri(x)sin x + t/ 2 (x)cos x = 0, (4.65) 

leaving y' p (x) = f 1 (x)cos x - t; 2 (x)sin x. 

From this 

y p (x) = — y 1 (x)sin x — y 2 (x)cos x + t/^xjcos x — t/ 2 (x)sin x. (4.66) 

Substituting (4.64) and (4.66) into (4.63) we obtain 

v\ (x)cos x — i/ 2 (x)sin x = tan x. 


(4.67) 
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Thus we have the two equations (4.65) and (4.67) from which to determine v\ (x), i/ 2 (x): 

i;i(x)sin x -f i/ 2 (x)cos x = 0, 
v\(x)cos x — i/ 2 (x)sin x = tan x. 


Solving we find: 

»'iW 

v’ 2 {x) 


0 cos x 

tan x — sin x — cos x tan x 

— = = sin x, 

sin x cos x — 1 

cos x — sin x 

sin x 0 

cos x tan x sin x tan x — sin 2 x 
sin x cos x — 1 cos x 

cos x — sin x 

cos 2 x — 1 

= cos x — sec x. 

cos x 


Integrating we find: 

v t (x) = -cos x + c 3 , v 2 (x) = sin x — In |sec x + tan x| + c 4 . (4.68) 

Substituting (4.68) into (4.64) we have 

y p (x) = ( — cos x + c 3 )sin x + (sin x — In |sec x + tan x| + c 4 )cos x 
= -sin x cos x + c 3 sin x + sin x cos x 
— In | sec x + tan x| (cos x) + c 4 cos x 
= c 3 sin x -f c 4 cos x — (cos x)(ln |sec x + tan x|). 

Since a particular integral is a solution free of arbitrary constants, we may assign any 
particular values A and B to c 3 and c 4 , respectively, and the result will be the particular 
integral 

A sin x + B cos x — (cos x)(ln | sec x + tan x | ). 

Thus y = y c + y p becomes 

y = c l sin x + c 2 cos x -f A sin x + B cos x — (cos x)(ln |sec x -f tan x|), 
which we may write as 

y = C l sin x + C 2 cos x — (cos x)(ln |sec x + tan x|), 
where C i = c t + A, C 2 = c 2 + B. 

Thus we see that we might as well have chosen the constants c 3 and c 4 both equal to 0 
in (4.68), for essentially the same result, 

y = c t sin x + c 2 cos x — (cos x)(ln |sec x + tan x|), 

would have been obtained. This is the general solution of the differential equation 
(4.63). 


The method of variation of parameters extends to higher-order linear equations. We 
now illustrate the extension to a third-order equation in Example 4.41, although we 
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hasten to point out that the equation of this example can be solved more readily by the 
method of undetermined coefficients. 


► Example 4.41 


Consider the differential equation 


d 3 y d 2 y 
dx 3 dx 2 


+ 


dx 


The complementary function is 

y c (x) = c i e x + c 2 e lx -f c 3 e 3x . 


(4.69) 


We assume as a particular integral 

y p (x) = v t (x)e x + v 2 (x)e 2x + v 3 (x)e 3x . (4.70) 

Since we have three functions v t ,v 2 , v 3 at our disposal in this case, we can apply three 
conditions. We have: 

y' p (x) = v 1 (x)e x + 2v 2 (x)e 2x + 3v 3 (x)e 3x + v\(x)e x + v' 2 (x)e 2x + v' 3 (x)e 3x . 

Proceeding in a manner analogous to that of the second-order case, we impose the 
condition 


leaving 

Then 


v\(x)e x -f v 2 (x)e 2x + v' 3 (x)e 3x = 0, 
y' p (x) = v i (x)e x -f 2v 2 (x)e 2x -f 3v 3 (x)e 3x . 


(4.71) 

(4.72) 


y"(x) = v i (x)e x + 4v 2 (x)e 2x + 9t; 3 (x)e 3x + v\(x)e x -f 2v 2 (x)e 2x + 3v 3 (x)e 3x . 

We now impose the condition 

v'i(x)e x + 2v 2 (x)e 2x + 3v 3 (x)e 3x = 0, (4.73) 

leaving 


y'p(x) = v l (x)e x -f 4v 2 (x)e 2x + 9i> 3 (x)e 3x . (4.74) 


From this, 

y’p{x) = v l (x)e x + Sv 2 (x)e 2x -f 21v 3 (x)e 3x + v\(x)e x + 4v' 2 (x)e 2x + 9v' 3 (x)e 3x . 

(4.75) 


We substitute (4.70), (4.72), (4.74), and (4.75) into the differential equation (4.69), 
obtaining: 

v l (x)e x + 8i; 2 (x)e 2x + 27v 3 (x)e 3x + v\(x)e x + 4v' 2 (x)e 2x + 9v' 3 (x)e 3x 

— 6v l (x)e x — 24v 2 (x)e 2x — 54i; 3 (x)c 3x + \\v i (x)e x + 22i? 2 (x)e 2x + 33v 3 (x)e 3x 

— 6v l (x)e x — 6v 2 {x)e 2x — 6v 3 {x)e 3x = e x 


or 

v\(x)e x + 4v 2 (x)e 2x + 9v' 3 (x)e 3x = e x . (4.76) 
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Thus we have the three equations (4.71), (4.73), (4.76) from which to determine 
v\(x), v' 2 (x), v' 3 (x): 

t>i(x)e x + v' 2 (x)e 2x + v’ 3 (x)e 3x = 0, 
v' l (x)e x + 2v' 2 {x)e 2x + 3v' 3 (x)e 3x = 0, 
v\ {x)e x + 4v 2 (x)e 2x + 9v' 3 (x)e 3x = e x . 


Solving, we find 


t/,(x) = 


v' 2 (x) = 


0 

e 2x 

e 3x 

0 

2e 2x 

3e 3x 

e x 

4e 2x 

9e 3x 

e x 

e 2x 

e 3x 

e x 

2e 2x 

3e 3x 

e x 

4e 2x 

9e 3x 

X 

0 

e 3x 1 


„6x 


1 1 
2 3 


~6x 


o 

e x 


3e 3x 

9e 3x 


e 2x 

2e lx 

4e 2x 


e 3x 

3e 3x 

9e 3x 


— e 


5x 


1 

2 ’ 


2e 


6x 


= —e 


u' 3 W = 


e x e 2x 0 

e* 2e lx 0 

e x 4e 2x e x 

X 

* 

so 

1 

1 1 

1 2 

e x e 2x e 3x 

2e 6x 

e x 2e 2x 3e 3x 



e x 4e 2x 9e 3x 




i 

= 2. e 


2x 


We now integrate, choosing all the constants of integration to be zero (as the previous 
example showed was possible). We find: 


Thus 


M*) = l x > v i( x ) = e \ y 3 W = -U 2x • 


j; p (x) = jxe x + e x e 2x — \e 2x e 3x = jxe x + \e x . 
Thus the general solution of Equation (4.53) is 

y = y c + y P = c ie x + C 2 e 2x + c 3 e 3x + jxe x + \e x 


or 

y = c\e x + c 2 e lx + c 3 e 3x + jxe x , 

where c\ = c t + 


In Examples 4.40 and 4.41 the coefficients in the differential equation were constants. 
The general discussion at the beginning of this section shows that the method applies 
equally well to linear differential equations with variable coefficients, once the 
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complementary function y c is known. We now illustrate its application to such an 
equation in Example 4.42. 

► Example 4.42 

Consider the differential equation 

+ ip ~ 2x % + 2y = 6(x2 + 1)2 ' (477) 

In Example 4.16 we solved the corresponding homogeneous equation 

{x2 + i) d^~ 2x '£ + 2y = 0 - 

From the results of that example, we see that the complementary function of equation 
(4.77) is 

y c (x) = c t x + c 2 (x 2 - 1). 

To find a particular integral of Equation (4.77), we therefore let 

)v(x) = Mx)x + v 2 {x)(x 2 - 1). (4.78) 

Then 

y' p (x) = v t (x) ■ 1 + r 2 W ' 2x + (x)x + v' 2 (x)(x 2 - 1 ). 


We impose the condition 


t/^xjx + v' 2 (x)(x 2 - 1) = 0, 

(4.79) 

leaving 


y'p(x) = V t (x) ■ 1 + v 2 (x) ■ 2x. 

(4.80) 

From this, we find 


y"(x) = v\ (x) + 2v 2 (x) + v 2 (x) ■ 2x. 

(4.81) 


Substituting (4.78), (4.80), and (4.81) into (4.77) we obtain 

(x 2 + l)[r'i(x) + 2r 2 (x) + 2xr' 2 (x)] — 2x[r,(x) + 2xr 2 (x)] 

+ 2[r t (x)x + r 2 (x)(x 2 — 1)] = 6(x 2 + l) 2 


or 

(x 2 + l)[r',(x) + 2xr' 2 (x)] = 6(x 2 + l) 2 . (4.82) 

Thus we have the two equations (4.79) and (4.82) from which to determine v\ (x) and 
v' 2 (x); that is, p' t (x) and t; 2 (x) satisfy the system 

ri(x)x + y' 2 (x)[x 2 — 1] = 0, 

v\ (x) + t/ 2 (x)[2x] = 6(x 2 + 1). 
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Solving this system, we find 


6(x 

0 x 2 - 1 

2 + 1) 2x 

— 6(x 2 + l)(x 2 — 1) 


x x 2 — 1 

1 2x 


x 2 + 1 


— 6(x 2 - 1), 


v' 2 (x) 


x 0 
1 6(x 2 -f 1) 

x x 2 — 1 
1 2x 


6x(x 2 + 1) , 

— 2 — ; — = 6x. 
x 2 + 1 


Integrating, we obtain 

i?i(x) = — 2x 3 + 6x, v 2 (x) = 3x 2 , (4.83) 

where we have chosen both constants of integration to be zero. Substituting (4.83) into 
(4.78), we have 

y P ( x ) = ( — 2x 3 + 6x)x + 3x 2 (x 2 — 1) 

= x 4 + 3x 2 . 


Therefore the general solution of Equation (4.77) may be expressed in the form 


y = y c + y P 

= c t x + c 2 (x 2 — 1) + x 4 + 3x 2 . 


Exercises 


Find the general solution of each of the differential equations in Exercises 1-18. 
d 2 y 


1. 

3. 

5. 

6 . 

7. 

8 . 

9. 

10 . 


dx 


j + y = cot x. 


d 2 y 


dx 


j + y = sec x. 


d 2 y 

dx 


2 +4 y = sec 2 2x. 


d 2 y 

— y + v = tan x sec x. 
dx 2 


d 2 y 


iy_ 

dx 


, 9 +4 —j— + 5v = e 2x sec x. 
dx 2 


d 2 y dy 

^-2l x + 5,=e-^2, 


d 2 y .dy ^ e 


-3x 


d X ‘ + 6 i; +9> '- s 


<^y 2 dy 

dx 2 dx 


2 2 — + y = xe x In x (x > 0). 


_ d 2 y 2 

2. — y + y = tan 2 x. 
dx 2 


4. 


d 2 y 


dx ,+»-*c X. 
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1 1 d y 

11. -p-=- 4- v = sec x esc x. 
dx 

12. 4- y = tan 3 x. 


13 . 


dx 

d 2 y dy 


+ 3-^ + 2 ,j> = 


1 


dx 2 ' ~ dx 




1 + e* 

1 

1 +e 2x 


1 


i, 

15 . -j~2 + y = . 

ax 1 + sin x 


2 dy_ 
dx 2 dx 

dx 2 dx 

<Py 2 dy 

dx 2 dx 


16 . ~ — y — 2 y^- 4- 3; = e x sin 1 x. 


1-7 “ y . ^ u x . e 

17 - TT + 3 — + 2 )' = — 


18 . — 2 — 2yy + y = xlnx (x > 0). 


19. Find the general solution of 

x 2 ^-y - 6x ^ + 10 y = 3x 4 + 6x 3 , 
dx A dx 

given that y = x 2 and y = x 5 are linearly independent solutions of the 
corresponding homogeneous equation. 

20. Find the general solution of 

{x + l)2 ^~ 2(x + 1) ^ + 2y = l 

given that y = x 4- 1 and y = (x 4- l) 2 are linearly independent solutions of the 
corresponding homogeneous equation. 

21. Find the general solution of 

(x 2 + 2x) ^y - 2(x 4- 1) ^ + 2y = (x + 2) 2 , 

given that y = x 4-1 and y = x 2 are linearly independent solutions of the 
corresponding homogeneous equation. 

22. Find the general solution of 

X 2 ^-y -x(x + 2)y 4-(x + 2)y = x 3 , 
dx dx 

given that y = x and y = xe x are linearly independent solutions of the corre- 
sponding homogeneous equation. 
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23. Find the general solution of 

x(x — 2 — (x 2 — 2)^j- + 2(x - 1)}; = 3x 2 (x — 2) 2 e x , 
ax ax 

given that y = e x and y = x 2 are linearly independent solutions of the corre- 
sponding homogeneous equation. 

24. Find the general solution of 


(2x + l)(x + 1) + 2x ^ - 2y = (2x + l) 2 , 


given that y = x and y = (x + 1) 1 are linearly independent solutions of the 
corresponding homogeneous equation. 


25. Find the general solution of 
d 2 y 


(sin 2 x) -j-j 


2 sin xcosx$^ + (cos 2 x + l)v = sin 3 x, 
ax 


given that y = sin x and y = x sin x are linearly independent solutions of the 
corresponding homogeneous equation. 

26. Find the general solution by two methods: 


d 3 y 

dx 3 


d 2 y dy 2 

^-r x +iy - xe 


4.5 THE CAUCHY-EULER EQUATION 


A. The Equation and the Method of Solution 


In the preceding sections we have seen how to obtain the general solution of the nth- 
order linear differential equation with constant coefficients. We have seen that in such 
cases the form of the complementary function may be readily determined. The general 
nth-order linear equation with variable coefficients is quite a different matter, however, 
and only in certain special cases can the complementary function be obtained explicity 
in closed form. One special case of considerable practical importance for which it 
is fortunate that this can be done is the so-called Cauchy-Euler equation (or equi- 
dimensional equation). This is an equation of the form 


x -^l + a x n ~ l 
)X dx" +aiX 


+ •■■ + a „- l x-J^ + a „y - F(x), (4.84) 


where a 0 , a l9 ..., a n - u a n are constants. Note the characteristic feature of this 
equation: each term in the left member is a constant multiple of an expression of the 
form 
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How should one proceed to solve such an equation? About the only hopeful thought 
that comes to mind at this stage of our study is to attempt a transformation. But what 
transformation should we attempt and where will it lead us? While it is certainly 
worthwhile to stop for a moment and consider what sort of transformation we might 
use in solving a “new” type of equation when we first encounter it, it is certainly not 
worthwhile to spend a great deal of time looking for clever devices which mathema- 
ticians have known about for many years. The facts are stated in the following theorem. 


THEOREM 4.14 


The transformation x = e x reduces the equation 


d n y 


a ° x " i? + ai 1 + ' ' ' + a "- 1 x it. + a * y = F(x) 

to a linear differential equation with constant coefficients. 


d n x y 


dy 


(4.84) 


This is what we need! We shall prove this theorem for the case of the second- order 
Cauchy-Euler differential equation 


9 d 2 y dy . 

a 0 x TT + a i x T~ + a 2y = F ( x )- 


(4.85) 


The proof in the general wth-order case proceeds in a similar fashion. Letting x = e\ 
assuming x > 0, we have t = In x. Then 

dy dy dt 1 dy 
dx dt dx x dt 


and 


d 2 y 1 d f dy\ + dy d ( l \ _ 1 ( d 2 y dt\ 1 dy 


dx 2 x dx\dt 
1 (d 2 y 1 


dt dx\x ) x\dt 2 dx) x 2 dt 


l dy 1 / d 2 y dy 


x\dt 2 x ) x 2 dt 


dt 2 


dt y 


Thus 


dy dy 2 d 2 y d 2 y dy 


x an( j x 2 , 9 

dx dt dx 


dt 2 


dt ’ 


Substituting into Equation (4.85) we obtain 


d 2 y dy 


°\dt 2 


-f t ] + a 


dy 


+ a 2 y = F(e') 


or 


where 




(4.86) 


A 0 = a 0 , A l = a l -a 0 , A 2 = a 2 , G(t) = F(e‘). 
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This is a second-order linear differential equation with constant coefficients, which was 
what we wished to show. 

Remarks. 1. Note that the leading coefficient a 0 x n in Equation (4.84) is zero for 
x = 0. Thus the basic interval a < x <b, referred to in the general theorems of Sec- 
tion 4.1, does not include x = 0. 

2. Observe that in the above proof we assumed that x > 0. If x < 0, the substitution 
x = — e l is actually the correct one. Unless the contrary is explicitly stated, we shall 
assume x > 0 when finding the general solution of a Cauchy-Euler differential 
equation. 


B. Examples 


► Example 4.43 


d^y 

dx 2 


2x^ + 2j> = x 3 . 
dx 


Let x = e l . Then, assuming x > 0, we have t = In x, and 


dy dy dt 1 dy 
dx dt dx x dt 9 


d 2 y_\fd 2 ydt\ 1 dy _ 1 fd 2 y dy\ 
dx 2 x\dt 2 dx) x 2 dt x 2 \dt 2 dt)' 


Thus Equation (4.87) becomes 

d 2 y 


dy _ 2 dy 
dt dt 


+ 2y = e 3t 


(4.87) 


or 


d 2 y - dy 
dt 2 dt 


+ 2y = 


(4.88) 


The complementary function of this equation is y c = c Y e l + c 2 e 2t . We find a particular 
integral by the method of undetermined coefficients. We assume y p = Ae 3t . Then 
y' p = 2Ae 3 \ y p = 9Ae 3 \ and substituting into Equation (4.88) we obtain 

2Ae 3t = e 3t . 

Thus A =j and we have y p = \e 3t . The general solution of Equation (4.88) is then 

y = c l e t + c 2 e 2t + \e 3t . 

But we are not yet finished! We must return to the original independent variable x. 
Since e l = x, we find 

y = c i x + c 2 x 2 +yx 3 . 

This is the general solution of Equation (4.87). 

Remarks. 1. Note carefully that under the transformation x = e l the right member 
of (4.87), x 3 , transforms into e 3t . The student should be careful to transform both sides 
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of the equation if he intends to obtain a particular integral of the given equation by 
finding a particular integral of the transformed equation, as we have done here. 

2. We hasten to point out that the following alternative procedure may be used. 
After finding the complementary function of the transformed equation one can im- 
mediately write the complementary function of the original given equation and then 
proceed to obtain a particular integral of the original equation by variation of param- 
eters. In Example 4.43, upon finding the complementary function c l e t + c 2 e 2t of 
Equation (4.88), one can immediately write the complementary function c l x + c 2 x 2 of 
Equation (4.87), then assume the particular integral y p (x) = ^(x)* + v 2 (x)x 2 , and 
from here proceed by the method of variation of parameters. However, when the 
nonhomogeneous function F transforms into a linear combination of UC functions, as 
it does in this example, the procedure illustrated is generally simpler. 

► Example 4.44 


.3 * 3 y d 2 y 


dy 


x 'dP~ 4 x i? + * x -ix 


- 8y = 4 In x. 


Assuming x > 0, we let x = e t . Then t = In x, and 

dy 1 dy 


dx 

d 2 y 

dx 2 


x dt ’ 

: _L (£y 

x 2 \dt 2 


dy 

dt 


d 3 y 

Now we must consider — - T . 

dx 6 


d 3 y 
dx 3 


x 2 dx l dt 2 


dy 

dt 


2 ( d 2 y dy 
x 3 l dt 2 dt 


1 fd 3 y dt 

d 2 y dt\ 

2 fd 2 y 

dy 

x 2 l dt 3 dx 

dt 2 dx ) 

x 3 \dt 2 ' 

' dt 


1 fd 3 y _ d 2 ^ 
x 3 l dt 3 dt 2 


d 2 y 

IF 


dy 

dt 


1 


d ll_F_ll + 7 d l 

dt 3 dt 2 dt 


Thus, substituting into Equation (4.89), we obtain 


d 3 y , d 2 y _ .dy 


dt 2 


— 3 -TT + 2 “T ) — 4 -~Y 


dt 


dt 


(4.89) 


d 2 y dy\ ( dy\ 

-JT--T + 8 hr -8y = 4t 
dt 2 dt J \dt) 


or 


d 3 y 

dt 3 


7^ + 14 ^-8, = 4, 
dt 2 dt 


The complementary function of the transformed equation (4.90) is 

y c = c l e t + c 2 e 21 + c 3 e 4 '. 


(4.90) 
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We proceed to obtain a particular integral of Equation (4.90) by the method of un- 
determined coefficients. We assume y p = At + B. Then y' p = A, y p = y' p = 0. Substi- 
tuting into Equation (4.90), we find 

14/4 - SAt - 8J 3 = At. 


Thus 


-8/4 = 4, 14/4-8J3 = 0, 

and so A = — B = — f . Thus the general solution of Equation (4.90) is 
y = + c 2 e 2 ' + c 3 e 4 ' -it 

and so the general solution of Equation (4.89) is 

y = + c 2 x 2 + C 3 x 4 - i In X - f . 

Remarks. In solving the Cauchy-Euler equations of the preceding examples, we 
observe that the transformation x = e‘ reduces 


and 



dy 
dt ’ 




dy 

dt’ 



to 


d^y ,d 2 y | dy 
dt 3 dt 2+ dt' 


We now show (without proof) how to find the expression into which the general term 

„ d"y 


dx n 


where n is an arbitrary positive integer, reduces under the transformation x = e l . We 
present this as the following formal four-step procedure. 


1. For the given positive integer n , determine 

r(r — l)(r — 2) • • • [r — (n — 1)]. 

2. Expand the preceding as a polynomial of degree n in r. 

d k y 

3. Replace r k by -r-r-, for each k = 1, 2, 3, . . . , n. 

at 

d n y 

4. Equate x n to the result in Step 3. 

For example, when n = 3, we have the following illustration. 

1. Since n = 3, n — 1 = 2 and we determine r(r — l)(r — 2). 

2. Expanding the preceding, we obtain r 3 - 3r 2 + 2r. 

d 3 y . d 2 y dy 

Replacing r 3 by r, r 2 by — and r by — , we have 

dt 2 dt 

dt 3 dt 2 + dt- 


3 . 
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4. Equating x 


<Py 

dx 2 


to this, we have the relation 


3 d*y_#y_ 3 d*y 2 dy 

dx 3 dt 3 dt 2 + dt' 

Note that this is precisely the relation we found in Example 4.44 and stated above. 


Exercises 

Find the general solution of each of the differential equations in Exercises 1-19. In each 
case assume x > 0. 


1. x 


2 d^l 

dx 2 


3x + 3y = 0. 
dx 


2. x 




dx 


, d 2 y 
dx 2 


3. 4x 2 ~t~t — 4x + 3y = 0. 


dy_ 

dx 


2 d 2 y , .. dy 


5. x 2 - r ^ + x-± + 4y = 0. 
dx L dx 


,2 Py 


7. 3x 2 -rAr - 4x + + 2y = 0. 
dx 


dx 


4, ** £ 4 -3,4 + 4,-0. 


dx 


6 . 


rf 2 }' , 


- 3x-p-+ 13}; = 0. 


dx 2 " v dx 


8 . ^ + *^ + 9) ,_ 0 . 


dx 2 


dx 


, d 2 y 

dx 2 


9. 9x 2 ^- + 3x^ + y = 0. 


dy 

dx 


10. x 2 - 5x ^ + 10}; = 0. 


dx 2 


dx 


,3 d3 y ^.2 dl y , 


11. x 3 — — t — 3x 2 — — t "h 6x 


dx 


dx 2 
2 , 


12. x 3 ^-4 + 2x 2 ^-4 - 10x -y- - 8y = 0. 
dx dx 2 dx 


dx 

dy 


6y = 0. 


13 x 3^!Z_ x 2^Z 

dx 3 dx 2 


6x^-+ 18}; = 0. 
dx 


d 2 y 

dx 2 


iy_ 

dx 


14. x 2 — 4x + 6y = 4x — 6. 


15. x 


2 d 2 y c dy a 3 

2 ^-4 - 5x-/ + 8y = 2x 3 . 
ax dx 


16. x 2 + 4x ^ + 2y = 4 In x. 


dy_ 

dx 


17. x 2 + x ^ + 4y = 2x In x. 
dx 1 dx 
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18. x 2 ^-^- + x^- + y = 4 sin In x. 
ax ax 


19. 


d 3 y 

dx 3 


d 2 y , . dy 


dx 2 


+ 2x^-2y = x 3 . 
dx 


Solve the initial- value problem in each of Exercises 20-27. In each case assume x > 0. 
2 °. x 2 0-2x^- I0y = 0, y( 1) = 5, /( 1) = 4. 

21. x 2 ^-4x^ + 6y = 0, y(2) = 0, y'( 2) = 4. 

dx dx 

22. x 2 ^ + 5x^+3y = 0, j*l) = 1, /(l)=-5. 

23. x 2 0-2y = 4x-8, y(l) = 4, /(1)=-1. 

24. x 2 - 4x ^ 4- 4y = 4x 2 - 6x 3 , y(2) = 4, /(2) = — 1. 

dx A dx 

25. x 2 0 + 2x£-6j;= 10x 2 , 3*1) =1, /(l)=-6. 

26. x 2 p^-5x^ + 8y = 2x 3 , y(2) = 0, /(2) = - 8. 

dx dx 

27. x2 ~cb? ~ 6y = ln x ’ 3*1) = 6> /(l)=-i 

28. Solve: 

{x + 2)2 ^~ ix + 2) ^~ 3y = 0 - 

29. Solve: 

(2x - 3) 2 ^| - 6(2x -3)^-+l2y = 0. 
dx dx 


4.6 STATEMENTS AND PROOFS OF THEOREMS ON THE SECOND-ORDER 
HOMOGENEOUS LINEAR EQUATION 

Having considered the most fundamental methods of solving higher-order linear 
differential equations, we now return briefly to the theoretical side of the subject and 
present detailed statements and proofs of the basic theorems concerning the second- 
order homogeneous equation. The corresponding results for both the general nth- 
order equation and the special second-order equation were introduced in Section 4. IB 
and employed frequently thereafter. By restricting attention here to the second-order 
case we shall be able to present proofs which are completely explicit in every detail. 
However, we point out that each of these proofs may be extended in a straight-forward 
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manner to provide a proof of the corresponding theorem for the general nth-order case. 
For general proofs, we again refer to Chapter 11. 

We thus consider the second-order homogeneous linear differential equation 

+ ai ^5x + a2 ^ y = °' ( 4 - 91 ) 

where a 0 , a l9 and a 2 are continuous real functions on a real interval a < x <b and 
a 0 (x) # 0 for any x on a < x < b. 

In order to obtain the basic results concerning this equation, we shall need to make 
use of the following special case of Theorem 4.1 and its corollary. 


THEOREM A 

Hypothesis. Consider the second-order homogeneous linear equation (4.91), where 
a 0 , a l5 and a 2 are continuous real functions on a real interval a < x < b and a 0 (x) # 0 
for any x on a < x < b. Let x 0 be any point of a < x < b; and let c 0 and c t be any two 
real constants. 

Conclusion 1. Then there exists a unique solution f of Equation (4.91) such that 
f(x 0 ) = c 0 and f'(x 0 ) = c l , and this solution f is defined over the entire interval 
a < x < b. 

Conclusion 2. In particular , the unique solution f of Equation (4.91), which is such 
that f(x 0 ) = 0 and f'(x 0 ) = 0, is the function f such that f(x) = 0 for all x on a < x < b. 

Besides this result, we shall also need the following two theorems from algebra. 


THEOREM B 

Two homogeneous linear algebraic equations in two unknowns have a nontrivial solution 
if and only if the determinant of coefficients of the system is equal to zero. 


THEOREM C 

Two linear algebraic equations in two unknowns have a unique solution if and only if the 
determinant of coefficients of the system is unequal to zero. 

We shall now proceed to obtain the basic results concerning Equation (4.91). Since 
each of the concepts involved has already been introduced and illustrated in Sec- 
tion 4.1, we shall state and prove the various theorems without further comments or 
examples. 


THEOREM 4.15 

Hypothesis. Let the functions f t and f 2 be any two solutions of the homogeneous 
linear differential equation (4.91) on a < x < b, and let c t and c 2 be any two arbitrary 
constants. 
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Conclusion. Then the linear combination c l f l + c 2 f 2 of f i and f 2 is also a solution 
of Equation {4.91) on a < x < b. 

Proof. We must show that the function / defined by 


f{x) = c l fi{x) + c 2 f 2 (x). 

a < x < b, 

(4.92) 

satisfies the differential equation (4.91) on a < x < b. From (4.92), we see that 


fix) = cj\(x ) + C2/2W, 

a < x < b, 

(4.93) 

and 



fix) = cj'i(x) + c 2 f 2 {x). 

a < x < b. 

(4.94) 


Substituting f(x) given by (4.92), f'(x) given by (4.93), and f"{x) given by (4.94) for y, 
dy/dx , and d 2 y/dx 2 , respectively, in the left member of differential equation (4.91), we 
obtain 


«oW[ c i/'i'W + C2/2W] + fliWEci/'iW + C2/2W] 

+ a 2 (x)[c,/i(x) + c 2 / 2 (x)]. (4.95) 

By rearranging terms, we express this as 

Cila 0 (x)f';(x) + afx)f\{x) + « 2 W/iW] 

+ c 2 la 0 (x)f'i(x) + a i (x)f , 2 (x) + a 2 (x)/ 2 (x)]. (4.96) 

Since by hypothesis, f x and f 2 are solutions of differential equation (4.91) on a < x < 
b , we have, respectively, 

«oW/TW + fliW/iM + « 2 W/iW = 0 
and 

floM/ 2 'M + a x (x)f 2 (x) + a 2 (x)f 2 (x) = 0 
for all x on a < x < b. 

Thus the expression (4.96) is equal to zero for all x on a < x < b, and therefore so is 
the expression (4.95). That is, we have 

+ c 2 f 2 (xy] + a 1 (x)[c 1 f\(x) + c 2 / 2 (x)] 

+ a 2 (x)[ci/ 1 (x) + c 2 / 2 (x)] = 0 

for all x on a < x < b, and so the function c x f x + c 2 f 2 is also a solution of differential 
equation (4.91) on this interval. Q.E.D. 


THEOREM 4.1 6 

Hypothesis. Consider the second-order homogeneous linear differential equation 
{4.91), where a 0 ,a x , and a 2 are continuous on a < x < b and a 0 (x) # 0 on a < x < b. 

Conclusion. There exists a set of two solutions of Equation {4.91) that are linearly 
independent on a < x < b. 
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Proof. We prove this theorem by actually exhibiting such a set of solutions. Let x 0 
be a point of the interval a < x <b. Then by Theorem A, Conclusion 1, there exists a 
unique solution f Y of Equation (4.91) such that 

fi(x 0 ) = 1 and f\(x o ) = 0 (4.97) 

and a unique solution f 2 of Equation (4.91) such that 

fi(x o) = 0 and f' 2 (x 0 ) = 1. (4.98) 

We now show that these two solutions f l and f 2 are indeed linearly independent. 
Suppose they were not. Then they would be linear dependent ; and so by the definition of 
linear dependence, there would exist constants c t and c 2 , not both zero, such that 

Cifi(x) + C 2 f 2 (x) = 0 (4.99) 

for all x such that a < x < b. Then also 

Cif'lix) 4- c 2 f 2 (x) = 0 (4.1 00) 

for all x such that a < x < b. The identities (4.99) and (4.100) hold at x = x 0 , giving 


c x/i(x 0 ) + c 2 / 2 (x 0 ) = 0, cj\(x q) + c 2 / 2 (x 0 ) = 0. 

Now apply conditions (4.97) and (4.98) to this set of equations. They reduce to 
c,( 1) + c 2 (0) = 0, c,(0) + c 2 (l) = 0. 

or simply c t = c 2 = 0, which is a contradiction (since and c 2 are not both zero). Thus 
the solutions /, and f 2 defined respectively by (4.97) and (4.98) are linearly independent 
on a < x < b. Q.E.D. 


THEOREM 4.17 

Two solutions j\ and f 2 of the second-order homogeneous linear differential equation 
(4.91) are linear independent on a < x <bif and only if the value of theWronskianof fi 
and f 2 is different from zero for some x on the interval a < x < b. 

Method of Proof. We prove this theorem by proving the following equivalent 
theorem. 


THEOREM 4.18 


Two solutions j\ and f 2 of the second-order homogeneous linear differential equation 
(4.91) are linearly dependent on a < x < bif and only if the value of the Wronskianof f t 
and f 2 is zero for all x on a < x < b: 


/i(x) f 2 (x) 
f i(x) f' 2 (x) 


for all x on a < x < b. 


Proof. Part 1. We must show that if the value of the Wronskian of f t and f 2 is zero 
for all x on a < x < b, then /, and f 2 are linearly dependent on a < x < b. We thus 
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assume that 


AM fi(x) 

/iM /'2W 


for all x such that a < x < b. Then at any particular x 0 such that a < x 0 < b, we have 

= 0 . 

Thus, by Theorem B, there exist constants c t and c 2 , not both zero, such that 


/i(*o) fi (*^o) 
/' l(*o) fWo) 


Cif l {x o )+c 2 f 2 (x o ) = 0, 
Clf'l(Xo) + C 2 f' 2 (x 0 ) = 0. 


(4.101) 


Now consider the function / defined by 

/(*) = cj^x) + c 2 / 2 (x), a < x < b. 

By Theorem 4.15, since and f 2 are solutions of differential equation (4.91), this 
function / is also a solution of Equation (4.91). From (4.101), we have 

f(x 0 ) = 0 and f'(x 0 ) = 0. 

Thus by Theorem A, Conclusion 2, we know that 

/(x) = 0 for all x on a < x < b. 


That is, 


cifi(x) + c 2 f 2 (x) = 0 


for all x on a < x < b, where and c 2 are not both zero. Therefore the solutions f l and 
f 2 are linearly dependent on a < x < b. 

Part 2. We must now show that if and f 2 are linearly dependent on a < x < b, then 
their Wronskian has the value zero for all x on this interval. We thus assume that and 
f 2 are linearly dependent on a < x < b. Then there exist constants and c 2 , not both 
zero, such that 

c ifi( x ) + c 2 f 2 (x) = 0 (4.102) 

for all x on a < x < b. From (4.102), we also have 

cj\(x) + c 2 f r 2 (x) = 0 (4.103) 


for all x on a < x < b. Now let x = x 0 be an arbitrary point of the interval a< x<b. 
Then (4.102) and (4.103) hold at x = x 0 . That is, 

C ifi(x o ) + c 2 f 2 (x o ) = 0, 

Clf'l(Xo) + C 2 f' 2 (x 0 ) = 0 , 

where and c 2 are not both zero. Thus, by Theorem B, we have 

fi(x 0 ) fz(x 0) =Q 
f ,(X 0 ) f' 2 (x 0) ■ 

But this determinant is the value of the Wronskian of /j and f 2 at x = x 0 , and x 0 is an 
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arbitrary point of a < x < b. Thus we have 

fl(x) f 2 (x) 
f'l(x) f'i(x) 

for all x on a < x < b. 


QE.D . 


THEOREM 4.19 


The value of the Wronskian of two solutions f x and f 2 of differential equation (4.91) 
either is zero for all x on a < x< b or is zero for no x on a < x < b. 

Proof. If f x and f 2 are linearly dependent on a < x < b, then by Theorem 4.18, the 
value of the Wronskian of f x and f 2 is zero for all x on a < x < b. 

Now let f x and f 2 be linearly independent on a < x < b\ and let W denote the 
Wronskian of f x and / 2 , so that 


fl(x) f 2 (x) 
f'l(x) f'lix) 


Differentiating this, we obtain 


w , ( , f\(x) f' 2 (x) /,(*) f 2 (x) 

’ /',(*) f' 2 (x) + f'l(x) f'i(x) 


and this reduces at once to 


W(x )= f lW fl(x) 

1 ’ f’Ux) f 2 (x) 


(4.104) 


Since and f 2 are solutions of differential equation (4.91), we have, respectively, 
«oM/i'W + a i(x)f\(x) + a 2 (x)f l (x) = 0, 
a 0 (x)f 2 (x) + a l (x)f 2 (x) + a 2 (x)f 2 (x) = 0, 

and hence 

/’,■(*) - 

«oW 

a o( x ) 

on a < x < b. Substituting these expressions into (4.104), we obtain 

I fi(x) f 2 (x) I 


W(X)= 

a 0 (x) a 0 (x) 

This reduces at once to 


a 0 (x) a 0 (x) 



fi(x) 

AM 


fl(x) 

f 2 (x) 

W’(x) = 

floW 

-“fi/i w 

«oW 

+ 

-“f’/.w 

«oM 

fl oW 



176 EXPLICIT METHODS OF SOLVING HIGHER-ORDER LINEAR DIFFERENTIAL EQUATIONS 


and since the last determinant has two proportional rows, this in turn reduces to 


which is simply 


W'(x) = - 


fli(x) 


a oM 


/iM /2W 
/iW /'2W 


W'{x) = W(x). 

a 0 (x) 


Thus the Wronskian W satisfies the first-order homogeneous linear differential 
equation 


dW a { (x) 
dx a 0 (x) 


W = 0. 


Integrating this from x 0 to x, where x 0 is an arbitrary point of a < x < b, we obtain 

W(x) = c expT— f . 

L Jx„flo (t) J 


Letting x = x 0 , we find that c = W(x 0 ). Hence we obtain the identity 


(4.105) 


valid for all x on a < x < b, where x 0 is an arbitrary point of this interval. 

Now assume that W(x 0 ) = 0. Then by identity (4.105), we have W(x) = Ofor all x on 
a < x < b. Thus by Theorem 4.18, the solutions f x and f 2 must be linearly dependent 
on a < x < b. This is a contradiction, since and f 2 are linearly independent. 
Therefore the assumption that W(x 0 ) = 0 is false, and so W(x 0 ) # 0. But x 0 is an 
arbitrary point of a < x < b. Thus W(x) is zero for no x on a < x < b. 

Q.E.D. 


THEOREM 4.20 

Hypothesis. Let and f 2 be any two linearly independent solutions of differential 
equation (4.91) on a < x < b. 

Conclusion. Then every solution f of differential equation (4.91) can be expressed as 
a suitable linear combination 


Cl/l +C2/2 

of these two linear independent solutions. 

Proof. Let x 0 be an arbitrary point of the interval a < x < b, and consider the 
following system of two linear algebraic equations in the two unknowns k x and k 2 : 

kifi(x 0 ) + k 2 f 2 (x 0 ) = f(x 0 ), 

kj\(x 0 ) + k 2 f' 2 (x 0 ) = f'(x 0 ). ( 4106 ) 

Since f { and f 2 are linearly independent on a < x < b, we know by Theorem 4.17 that 
the value of the Wronskian of f x and f 2 is different from zero at some point of this 
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interval. Then by Theorem 4.19 the value of the Wronskian is zero for no x on 
a < x < b and hence its value at x 0 is not zero. That is, 


/ i(*o) 
f\ |(X 0 ) 


f 2(*o) 

fiixo) 


#0. 


Thus by Theorem C, the algebraic system (4.106) has a unique solution fcj = Cj and 
k 2 = c 2 . Thus for k t = Ci and k 2 = c 2 , each left member of system (4.106) is the same 
number as the corresponding right member of (4.106). That is, the number Cj/^Xq) + 
c 2 / 2 (x 0 ) is equal to the number f(x 0 ), and the number c^'^Xq) + c 2 /' 2 (x 0 )isequalto 
the number f'(x 0 ). But the numbers c^fxo) + c 2 / 2 (x 0 )andc 1 /' 1 (x 0 ) + c 2 /' 2 (x 0 )are 
the values of the solution c y f x + c 2 f 2 and its first derivative, respectively, at x 0 ; and the 
numbers /(x 0 ) and /'(x 0 ) are the values of the solution / and its first derivative, 
respectively, at x 0 . Thus the two solutions Cih + c 2 f 2 and / have equal values and 
their first derivative also have equal values at x 0 . Hence by Theorem A, Conclusion 1, 
we know that these two solutions are identical throughout the interval a < x < b. That 
is, 


fix) = CiMx) + c 2 / 2 (x) 


for all x on a < x < b, and so / is expressed as a linear combination of and f 2 . 

Q.E.D. 


Exercises 


1. Consider the second-order homogenous linear differential equation 


<^_y , 

dx 2 dx 


+ 2y = 0. 


(a) Find the two linearly independent solutions h and f 2 of this equation which 
are such that 

/i (0) = 1 and f\ (0) = 0 

and 

hi 0) = 0 and f' 2 ( 0)=1. 

(b) Express the solution 

3e x + 2e 2x 

as a linear combination of the two linearly independent solutions and f 2 
defined in part (a). 

2. Consider the second-order homogeneous linear differential equation 

a °^5x^ + ai ^5x + a2 ^ y = °’ ^ 

where a 0 , a u and a 2 are continuous on a real interval a < x < b, and a 0 (x) ^ 0 for 
all x on this interval. Let f x and f 2 be two distinct solutions of differential equa- 
tion (A) on a < x < b, and suppose / 2 (x) # 0 for all x on this interval. Let 
W[f t(x), / 2 (x)] be the value of the Wronskian of h and f 2 at x. 
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(a) Show that 

d [7i(*)l WUMJAxft 

dxlfAx)] Ui(x)h 2 
for all x on a < x < b. 

(b) Use the result of part (a) to show that if W[f t (x), / 2 (x)] = 0 for all x such that 
a < x < b, then the solutions fy and f 2 are linearly dependent on this interval. 

(c) Suppose the solutions fy and f 2 are linearly independent on a < x < b, and let 
/be the function defined by f(x) = fy (x)/ / 2 (x), a<x<b. Show that / is a 
monotonic function on a < x < b. 

3. Let f y and f 2 be two solutions of the second-order homogeneous linear differential 
equation (A) of Exercise 2. 

(a) Show that if fy and f 2 have a common zero at a point x 0 of the interval 
a < x < b, then fy and f 2 are linearly dependent on a < x < b. 

(b) Show that if fy and f 2 have relative maxima at a common point x 0 of the 
interval a < x < b, then f x and f 2 are linearly dependent on a < x < b. 

4. Consider the second-order homogeneous linear differential equation (A) of Exer- 
cise 2. 

(a) Let and f 2 be two solutions of this equation. Show that if and f 2 are 
linearly independent on a < x < b and A y , A 2 , By , and B 2 are constants such 
that AyB 2 — A 2 By / 0, then the solutions Ayfy + A 2 f 2 andByfy + B 2 f 2 of 
Equation (A) are also linearly independent on a < x < b. 

(b) Let {fy, f 2 } be one set of two linearly independent solutions of Equation (A) 
on a < x < b, and let {g { , g 2 } be another set of two linearly independent 
solutions of Equation (A) on this interval. Let W[fy(x), / 2 (x)] denote the 
value of the Wronskian of fy and f 2 at x, and let W[gy(x), ^ 2 (x)] denote the 
value of the Wronskian of g { and g 2 at x. Show that there exists a constant 
c 0 such that 

^[/lW,/ 2 (x)]=C^[^ 1 (x), 02 (x)] 
for all x on a < x < b. 

5. Let fy and f 2 be two solutions of the second-order homogeneous linear differential 
equation (A) of Exercise 2. Show that if fy and f 2 are linearly independent on 
a < x <b and are such that f\{x 0 ) = fii* o) = 0 at some point x 0 of this interval, 
then ^(xq) = a 2 (x 0 ) = 0. 



= CHAPTER FIVE= 

Applications of Second-Order Linear Differential Equations 
with Constant Coefficients 


Higher-order linear differential equations, which were introduced in the previous 
chapter, are equations having a great variety of important applications. In particular, 
second-order linear differential equations with constant coefficients have numerous 
applications in physics and in electrical and mechanical engineering. Two of these 
applications will be considered in the present chapter. In Sections 5. 1-5.5 we shall 
discuss the motion of a mass vibrating up and down at the end of a spring, while in 
Section 5.6 we shall consider problems in electric circuit theory. 


5.1 THE DIFFERENTIAL EQUATION OF THE VIBRATIONS OF A MASS 
ON A SPRING 

The Basic Problem 

A coil spring is suspended vertically from a fixed point on a ceiling, beam, or other 
similar object. A mass is attached to its lower end and allowed to come to rest in an 
equilibrium position. The system is then set in motion either (1) by pulling the mass 
down a distance below its equilibrium position (or pushing it up a distance above it) 
and subsequently releasing it with an initial velocity (zero or nonzero, downward or 
upward) at t = 0; or (2) by forcing the mass out of its equilibrium position by giving it a 
nonzero initial velocity (downward or upward) at t = 0. Our problem is to determine 
the resulting motion of the mass on the spring. In order to do so we must also consider 
certain other phenomena that may be present. For one thing, assuming the system is 
located in some sort of medium (say “ordinary” air or perhaps water), this medium 
produces a resistance force that tends to retard the motion. Also, certain external forces 
may be present. For example, a magnetic force from outside the system may be acting 
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upon the mass. Let us then attempt to determine the motion of the mass on the spring, 
taking into account both the resistance of the medium and possible external forces. We 
shall do this by first obtaining and then solving the differential equation for the motion. 

In order to set up the differential equation for this problem we shall need two laws of 
physics: Newton’s second law and Hooke’s law. Newton’s second law was encountered 
in Chapter 3, and we shall not go into a further discussion of it here. Let us then recall 
the other law that we shall need. 


Hooke's Law 

The magnitude of the force needed to produce a certain elongation of a spring is 
directly proportional to the amount of this elongation, provided this elongation is not 
too great. In mathematical form, 

\F\ = ks, 

where F is the magnitude of the force, s is the amount of elongation, and k is a constant 
of proportionality which we shall call the spring constant. 

The spring constant k depends upon the spring under consideration and is a measure 
of its stiffness. For example, if a 30-lb weight stretches a spring 2 ft, then Hooke’s law 
gives 30 = (k)( 2); thus for this spring k = 15 lb/ft. 

When a mass is hung upon a spring of spring constant k and thus produces an 
elongation of amount s, the force F of the mass upon the spring therefore has magni- 
tude ks. The spring at the same time exerts a force upon the mass called the restoring 
force of the spring. This force is equal in magnitude but opposite in sign to F and 
hence has magnitude — ks. 

Let us formulate the problem systematically. Let the coil spring have natural 
(unstretched) length L. The mass m is attached to its lower end and comes to rest in its 
equilibrium position, thereby stretching the spring an amount / so that its stretched 
length is L + Z. We choose the axis along the line of the spring, with the origin 0 at the 
equilibrium position and the positive direction downward. Thus, letting x denote the 
displacement of the mass from 0 along this line, we see that x is positive, zero, or 
negative according to whether the mass is below, at, or above its equilibrium position. 
(See Figure 5.1.) 


Forces Acting Upon the Mass 

We now enumerate the various forces that act upon the mass. Forces tending to pull the 
mass downward are positive, while those tending to pull it upward are negative. The 
forces are: 

1. F 1? the force of gravity , of magnitude mg, where g is the acceleration due to 
gravity. Since this acts in the downward direction, it is positive, and so 

F\ = mg. (5.1) 

2. F 2 , the restoring force of the spring. Since x + Z is the total amount of elongation, 
by Hooke’s law the magnitude of this force is k(x + Z). When the mass is below the end 
of the unstretched spring, this force acts in the upward direction and so is negative. Also, 
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stretched to length 
L + l + x 


Figure 5.1 


for the mass in such a position, x + / is positive. Thus, when the mass is below the end of 
the unstretched spring, the restoring force is given by 

F 2 = -k(x + /). (5.2) 

This also gives the restoring force when the mass is above the end of the unstretched 
spring, as one can see by replacing each italicized word in the three preceding sentences 
by its opposite. When the mass is at rest in its equilibrium position the restoring force F 2 
is equal in magnitude but opposite in direction to the force of gravity and so is given by 
— mg. Since in this position x = 0, Equation (5.2) gives 

-mg = -k{ 0 + /) 


or 


mg = kl. 

Replacing kl by mg in Equation (5.2) we see that the restoring force can thus be written 
as 


F 2 = —kx — mg. 


(5.3) 


3. F 3 , the resisting force of the medium, called the damping force. Although the 
magnitude of this force is not known exactly , it is known that for small velocities it is 
approximately proportional to the magnitude of the velocity: 


\F 3 \ = a 


dx 

It 


(5.4) 


where a > 0 is called the damping constant. When the mass is moving downward , F 3 acts 
in the upward direction (opposite to that of the motion) and so F 3 < 0. Also, since m is 
moving downward , x is increasing and dx/dt is positive. Thus, assuming Equation (5.4) 
to hold, when the mass is moving downward , the damping force is given by 


^3 = 



(a > 0). 


(5.5) 
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This also gives the damping force when the mass is moving upward , as one may see by 
replacing each italicized word in the three preceding sentences by its opposite. 

4. F 4 , any external impressed forces that act upon the mass. Let us denote the 
resultant of all such external forces at time t simply by F(t) and write 


h = m 


(5.6) 


We now apply Newton’s second law, F = ma , where F = F { + F 2 + F 3 + F 4 . Using 
(5. 1), (5.3), (5.5), and (5.6), we find 

d 2 x dx 

m — y = mg — kx — mg — a - — b F(t) 
dt dt 


or 


d 2 x dx 

m !t T + a ~dt +kx = F ( 5 - 7 ) 

This we take as the differential equation for the motion of the mass on the spring. 
Observe that it is a nonhomogeneous second-order linear differential equation with 
constant coefficients. If a = 0 the motion is called undamped ; otherwise it is called 
damped. If there are no external impressed forces, F(t) = 0 for all t and the motion is 
called free ; otherwise it is called forced. In the following sections we consider the 
solution of (5.7) in each of these cases. 


5.2 FREE, UNDAMPED MOTION 


We now consider the special case of free , undamped motion , that is, the case in which 
both a = 0 and F{t) = 0 for all t. The differential equation (5.7) then reduces to 


d 2 x 

m -j-y -f kx = 0, 
dt 


(5.8) 


where m(> 0) is the mass and fc(>0) is the spring constant. Dividing through by m 
and letting k/m = A 2 , we write (5.8) in the form 


d 2 x 2 
, 2 "b A x = 0. 
dt 2 


(5.9) 


The auxiliary equation 


r 2 + A 2 = 0 


has roots r = ±Xi and hence the general solution of (5.8) can be written 

x = c l sin At + c 2 cos At, (5.10) 

where c x and c 2 are arbitrary constants. 

Let us now assume that the mass was initially displaced a distance x 0 from its 
equilibrium position and released from that point with initial velocity v 0 . Then, in 
addition to the differential equation (5.8) [or (5.9)], we have the initial conditions 

*(0) = x o , (5.11) 

x'(0) = v 0 . (5.12) 
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Differentiating (5.10) with respect to t , we have 

^ = c x X cos Xt — c 2 X sin Xt . (5.13) 

at 

Applying conditions (5.11) and (5. 1 2) to Equations (5.10) and (5. 1 3), respectively, we see 
at once that 

C 2 ~ 

0,1 = v 0 . 

Substituting the values of and c 2 so determined into Equation (5.10) gives the 
particular solution of the differential equation (5.8) satisfying the conditions (5.1 1) and 
(5.12) in the form 


v 0 . . , 

x = — sm X f + x 0 cos Xt. 

A 


We put this in an alternative form by first writing it as 


where 


Then, letting 


= c — sin Xt + — cos Xt 
c c 


c = 


(v 0 /X) 


+ xl > 0 . 


= - sin 0, 


— = cos 0, 
c 


(5.14) 


(5.15) 


(5.16) 


Equation (5.14) reduces at once to 

x = c cos (Xt + 0), (5.17) 

where c is given by Equation (5.15) and 0 is determined by Equations (5.16). Since 
X = yjk/m, we now write the solution (5.17) in the form 


X = c cos 



(5.18) 


This, then, gives the displacement x of the mass from the equilibrium position O as a 
function of the time t (t > 0). We see at once that the free, undamped motion of the mass 
is a simple harmonic motion. The constant c is called the amplitude of the motion and 
gives the maximum (positive) displacement of the mass from its equilibrium position. 
The motion is a periodic motion, and the mass oscillates back and forth between x = c 
and x = — c. We have x = c if and only if 


Ik 

— t + 0 = ±2mt, 
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n = 0, 1, 2, 3, . . . ; t > 0. Thus the maximum (positive) displacement occurs if and only if 


t 


— (±2mt — </>)> 0, 
k 


(5.19) 


where n = 0, 1, 2, 3,.... 

The time interval between two successive maxima is called the period of the motion. 
Using (5.19), we see that it is given by 


2n 2n 



(5.20) 


The reciprocal of the period, which gives the number of oscillations per second, is 
called the natural frequency (or simply frequency) of the motion. The number 0 is called 
the phase constant (or phase angle). The graph of this motion is shown in Figure 5.2. 



► Example 5.1 


An 8-lb weight is placed upon the lower end of a coil spring suspended from the ceiling. 
The weight comes to rest in its equilibrium position, thereby stretching the spring 6 in. 
The weight is then pulled down 3 in. below its equilibrium position and released at 
t = 0 with an initial velocity of 1 ft/sec, directed downward. Neglecting the resistance 
of the medium and assuming that no external forces are present, determine the am- 
plitude, period and frequency of the resulting motion. 


Formulation. This is clearly an example of free, undamped motion and hence 
Equation (5.8) applies. Since the 8-lb weight stretches the spring 6 in. = j ft, Hooke’s 
law F = ks gives 8 = k(%) and so k = 16 lb/ft. Also, m = w/g = ^ (slugs), and so 
Equation (5.8) gives 


8 d 2 x 
32 li 2 


+ 16x = 0 


or 


d 2 x 

-yy + 64X = 0. 
dt 2 


(5.21) 
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Since the weight was released with a downward initial velocity of 1 ft/sec from a point 
3 in. ( = ^ ft) below its equilibrium position, we also have the initial conditions 

x(0) = x'(0) = 1. (5.22) 

Solution. The auxiliary equation corresponding to Equation (5.21) is r 2 + 64 = 0, 
and hence r = ± 8 i. Thus the general solution of the differential equation (5.21) may be 
written 


x = c x sin 8t + c 2 cos 8 1 , (5.23) 

where c t and c 2 are arbitrary constants. Applying the first of conditions (5.22) to this, 
we find c 2 = Differentiating (5.23), we have 

dx 

— = 8 <?! cos 8r — 8c 2 sin 8 1 . 
dt 


Applying the second of conditions (5.22) to this, we have 8 = 1 and hence c { = 
Thus the solution of the differential equation (5.21) satisfying the conditions (5.22) is 

x = f sin St + * cos 8 f. (5.24) 


Let us put this in the form (5.18). We find 


and thus write 


Thus, letting 


ir-er-f 


x = sin + — j— cos 8 1 


COS </) 


_V5 


5 ’ 


• a >/5 

sin 4> = 


we write the solution (5.24) in the form 


x = cos(8t + (p), 
8 


(5.25) 


(5.26) 


where 0 is determined by Equations (5.25). From these equations we find that </> « 
-0.46 radians. Taking ^5 % 2.236, the solution (5.26) is thus given approximately by 

x = 0.280 cos ( 8 1 — 0.46). 

The amplitude of the motion ^/5/8 « 0.280 (ft). By formula (5.20), the period is 
2tc/8 = te/4 (sec), and the frequency is 4/n oscillations/sec. The graph is shown in 
Figure 5.3. 

Before leaving this problem, let us be certain that we can set up initial conditions 
correctly. Let us replace the third sentence in the statement of the problem by the 
following: “The weight is then pushed up 4 in. above its equilibrium position and 
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released at t = 0, with an initial velocity of 2 ft/sec, directed upward .” The initial 
conditions (5.22) would then have been replaced by 

x(0) = -i 
x'(0) = — 2. 

The minus sign appears before the ^ because the initial position is 4 in. = 7 foot above 
the equilibrium position and hence is negative. The minus sign before the 2 is due to the 
fact that the initial velocity is directed upward , that is, in the negative direction. 


Exercises 

Note: In Exercises 1-7 neglect the resistance of the medium and assume that no 
external forces are present. 

1. A 12-lb weight is placed upon the lower end of a coil spring suspended from the 
ceiling. The weight comes to rest in its equilibrium position, thereby stretching the 
spring 1.5 in. The weight is then pulled down 2 in. below its equilibrium position 
and released from rest at t = 0. Find the displacement of the weight as a function of 
the time; determine the amplitude, period, and frequency of the resulting motion; 
and graph the displacement as a function of the time. 

2. A 16-lb weight is placed upon the lower end of a coil spring suspended vertically 
from a fixed support. The weight comes to rest in its equilibrium position, thereby 
stretching the spring 6 in. Determine the resulting displacement as a function of 
time in each of the following cases: 

(a) If the weight is then pulled down 4 in. below its equilibrium position and 
released at t = 0 with an initial velocity of 2 ft/sec, directed downward. 

(b) If the weight is then pulled down 4 in. below its equilibrium position and 
released at t = 0 with an initial velocity of 2 ft/sec, directed upward. 

(c) If the weight is then pushed up 4 in. above its equilibrium position and 
released at t = 0 with an initial velocity of 2 ft/sec, directed downward. 
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3. A 4-lb weight is attached to the lower end of a coil spring suspended from the 
ceiling. The weight comes to rest in its equilibrium position, thereby stretching the 
spring 6 in. At time t = 0 the weight is then struck so as to set it into motion with an 
initial velocity of 2 ft/sec, directed downward. 

(a) Determine the resulting displacement and velocity of the weight as functions 
of the time. 

(b) Find the amplitude, period, and frequency of the motion. 

(c) Determine the times at which the weight is 1.5 in. below its equilibrium 
position and moving downward. 

(d) Determine the times at which it is 1.5 in. below its equilibrium position and 
moving upward. 

4. A 64-lb weight is placed upon the lower end of a coil spring suspended from a rigid 
beam. The weight comes to rest in its equilibrium position, thereby stretching the 
spring 2 ft. The weight is then pulled down 1 ft below its equilibrium position and 
released from rest at t = 0. 

(a) What is the position of the weight at t = 5n/\2? How fast and which way is it 
moving at the time? 

(b) At what time is the weight 6 in. above its equilibrium position and moving 
downward? What is its velocity at such time? 

5. A coil spring is such that a 25-lb weight would stretch it 6 in. The spring is 
suspended from the ceiling, a 16-lb weight is attached to the end of it, and the 
weight then comes to rest in its equilibrium position. It is then pulled down 4 in. 
below its equilibrium position and released at t = 0 with an initial velocity of 

2 ft/sec, directed upward. 

(a) Determine the resulting displacement of the weight as a function of the time. 

(b) Find the amplitude, period, and frequency of the resulting motion. 

(c) At what time does the weight first pass through its equilibrium position and 
what is its velocity at this instant? 

6. An 8-lb weight is attached to the end of a coil spring suspended from a beam and 
comes to rest in its equilibrium position. The weight is then pulled down A feet 
below its equilibrium position and released at t = 0 with an initial velocity of 

3 ft/sec, directed downward. Determine the spring constant k and the constant A if 
the amplitude of the resulting motion is v /^and the period is tc/2. 

7. An 8-lb weight is placed at the end of a coil spring suspended from the ceiling. After 
coming to rest in its equilibrium position, the weight is set into vertical motion and 
the period of the resulting motion is 4 sec. After a time this motion is stopped, and 
the 8-lb weight is replaced by another weight. After this other weight has come to 
rest in its equilibrium position, it is set into vertical motion. If the period of this new 
motion is 6 sec, how heavy is the second weight? 

8. A simple pendulum is composed of a mass m (the bob) at the end of a straight wire 
of negligible mass and length /. It is suspended from a fixed point S (its point of 
support) and is free to vibrate in a vertical plane (see Figure 5.4). Let SP denote the 
straight wire; and let 9 denote the angle that SP makes with the vertical SP 0 at time 
t , positive when measured counterclockwise. We neglect air resistance and assume 
that only two forces act on the mass m: F { , the tension in the wire; and F 2 , the force 
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due to gravity, which acts vertically downward and is of magnitude mg. We write 
F 2 = F T + F n , where F T is the component of F 2 along the tangent to the path of m 
and F n is the component of F 2 normal to F T . Then F N = — F { and F T = -mg sin 9, 
and so the net force acting on m is F i + F 2 = F x + F T + F N = - mg sin 9 , along the 
arc P 0 P. Letting s denote the length of the arc P 0 P , the acceleration along this arc is 
d 2 s/dt 2 . Hence applying Newton’s second law, we have md 2 s/dt 2 = —mg sin 9. 
But since s = 19, this reduces to the differential equation 


,d 2 e 

ml = —mg sin 9 or 
dt 


d 2 9 g . 

~tt + t sin 9 = 0. 
dt 2 l 


(a) The equation 


cP9 

dt 2 


is a nonlinear second-order differential equation. Now recall that 


9 3 

sin 9 = 9- — 



Hence if 9 is sufficiently small, we may replace sin 9 by 9 and consider the 
approximate linear equation 


d 2 9 g 
-n; + t 9 = 0. 
dt 2 l 


Assume that 9 = 9 0 and d9/dt = 0 when t = 0. Obtain the solution of this 
approximate equation that satisfies these initial conditions and find the 
amplitude and period of the resulting solution. Observe that this period is 
independent of the initial displacement. 

(b) Now return to the nonlinear equation 


d 2 9 g . 

\ 2 T t sin 9 = 0. 
dt 2 l 
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Multiply through by 2 d6/dt , integrate, and apply the initial condition 6 = 6 0 , 
d6/dt = 0. Then separate variables in the resulting equation to obtain 

d6 

y/cos 6 — cos 6 0 

From this equation determine the angular velocity d6/dt as a function of 0. 
Note that the left member cannot be integrated in terms of elementary 
functions to obtain the exact solution 6(t) of the nonlinear differential 
equation. 
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We now consider the effect of the resistance of the medium upon the mass on the 
spring. Still assuming that no external forces are present, this is then the case of free , 
damped motion . Hence with the damping coefficient a > 0 and F(t) = 0 for all t, the basic 
differential equation (5.7) reduces to 

d 2 x dx 

m-rr + a— + fcx = 0. (5.27) 

dr dt 

Dividing through by m and putting k/m = X 2 and a/m = 2b (for convenience) we have 
the differential equation (5.27) in the form 

d 2 x dx 

77 + 2b — + X 2 x = 0. (5.28) 

dt 2 dt 

Observe that since a is positive, b is also positive. The auxiliary equation is 

r 2 + 2 br + X 2 = 0. 

Using the quadratic formula we find that the roots of (5.29) are 

Three distinct cases occur, depending upon the nature of these roots, which in turn 
depends upon the sign of b 2 — X 2 . 


(5.29) 

(5.30) 


Case 1 . Damped , Oscillatory Motion. Here we consider the case in which b < X, 
which implies that b 2 — X 2 < 0. Then the roots (5.30) are the conjugate complex 
numbers —b± yfx 2 — b 2 i and the general solution of Equation (5.28) is thus 

x = e~ bt (c l sin^/X 2 — b 2 t + c 2 coSy/X 2 — b 2 t ), (5.31) 

where c x and c 2 are arbitrary constants. We may write this in the alternative form 

x = ce~ bt cos(y/X 2 — b 2 t + </>), 


where c = <Jc\ + c\ > 0 and $ is determined by the equations 

r . 

= —sin 0, 

= cos </>. 


\J C \ + 

C 2 


r 2 

C 2 


\J C 1 + 


Cl 


(5.32) , 
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The right member of Equation (5.32) consists of two factors, 
ce~ bt and cos^A 2 — b 2 t + 0). 

The factor ce~ bt is called the damping factor , or time-varying amplitude. Since c > 0, it 
is positive; and since b > 0, it tends to zero monotonically as t oo. In other words, as 
time goes on this positive factor becomes smaller and smaller and eventually becomes 
negligible. The remaining factor, cos^A 2 — b 2 t + <£), is, of course, of a periodic, 
oscillatory character; indeed it represents a simple harmonic motion. The product of 
these two factors, which is precisely the right member of Equation (5.32), therefore 
represents an oscillatory motion in which the oscillations become successively smaller 
and smaller. The oscillations are said to be “damped out,” and the motion is described 
as damped , oscillatory motion . Of course, the motion is no longer periodic, but the time 
interval between two successive (positive) maximum displacements is still referred to as 
the period. This is given by 

2n 

The graph of such a motion is shown in Figure 5.5, in which the damping factor ce~ bt 
and its negative are indicated by dashed curves. 

The ratio of the amplitude at any time T to that at time 


T- 


2n 


one period before T is the constant 


exp 


2nb ^ 


Thus the quantity 2 t ib/JX 2 — b 2 is the decrease in the logarithm of the amplitude ce bt 
over a time interval of one period. It is called the logarithmic decrement. 



Figure 5.5 
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If we now return to the original notation of the differential equation (5.27), we see 
from Equation (5.32) that in terms of the original constants m, a, and fc, the general 
solution of (5.27) is 



Since b < X is equivalent to a/2m < yjk/m, we can say that the general solution of 
(5.27) is given by (5.33) and that damped, oscillatory motion occurs when a < 2^/ km. 
The frequency of the oscillations 



1 jk a 2 3 
2n\j m Am 2 


If damping were not present, a wo uld e qual zero and the natural frequency of an 
undamped system would be (l/2n)y/k/m. Thus the frequency of the oscillations (5.34) 
in the damped oscillatory motion (5.33) is less than the natural frequency of the 
corresponding undamped system. 


Case 2. Critical Damping. This is the case in which b = A, which implies that 
b 2 — l 1 — 0. The roots (5.30) are thus both equal to the real negative number — b , and 
the general solution of Equation (5.28) is thus 

x = {c x + c 2 t)e~ bt . (5.35) 

The motion is no longer oscillatory; the damping is just great enough to prevent 
oscillation^. Any slight decrease in the amount of damping, however, will change the 
situation back to that of Case 1 and damped oscillatory motion will then occur. Case 2 
then is a borderline case; the motion is said to be critically damped. 


From Equation (5.35) we see that 

lim x = lim Cl ° 2 1 = 0. 

pbi 

t~> oo t -* oo «- 


Hence the mass tends to its equilibrium position as t -+ oo. Depending upon the initial 
conditions present, the following possibilities can occur in this motion: 


1. The mass neither passes through its equilibrium position nor attains an extremum 
(maximum or minimum) displacement from equilibrium for t > 0. It simply 
approaches its equilibrium position monotonically as t -► oo.,(See Figure 5.6a.) 

2. The mass does not pass through its equilibrium position for t > 0, but its 
displacement from equilibrium attains a single extremum for t = T x > 0. After this 
extreme displacement occurs, the mass tends to its equilibrium position monotonically 
as t -+ oo. (See Figure 5.6 b.) 

3. The mass passes through its equilibrium position once at t = T 2 > 0 and then 
attains an extreme displacement at t = T 3 > T 2 , following which it tends to its 
equilibrium position monotonically as t -► oo. (See Figure 5.6c.) 
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H 


(a) 





Case 3. Overcritical Damping. Finally, we consider here the case in which b > 2, 
which implies that b 2 — X 1 > 0. Here the roots (5.30) are the distinct, real negative 
numbers 

r x = -b + Jb 2 - A 2 
and 

r 2 = ~b- jb 2 - A 2 . 

The general solution of (5.28) in this case is 

x = c,e ri * + c 2 e r2t . (5.36) 

The damping is now so great that no oscillations can occur. Further, we can no longer 
say that every decrease in the amount of damping will result in oscillations, as we could 
in Case 2. The motion here is said to be overcritically damped (or simply overdamped). 

Equation (5.36) shows us at once that the displacement x approaches zero as t -► oo. 
As in Case 2 this approach to zero is monotonic for t sufficiently large. Indeed, the three 
possible motions in Cases 2 and 3 are qualitatively the same. Thus the three motions 
illustrated in Figure 5.6 can also serve to illustrate the three types of motion possible in 
Case 3. 

► Example 5.2 

A 32-lb weight is attached to the lower end of a coil spring suspended from the ceiling. 
The weight comes to rest in its equilibrium position, thereby stretching the spring 2 ft. 
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The weight is then pulled down 6 in. below its equilibrium position and released at 
t = 0. No external forces are present; but the resistance of the medium in pounds is 
numerically equal to 4 (dx/dt), where dx/dt is the instantaneous velocity in feet per 
second. Determine the resulting motion of the weight on the spring. 


Formulation. This is a free, damped motion and Equation (5.27) applies. Since 
the 32-lb weight stretches the spring 2 ft, Hooke’s law, F = ks , gives 32 = k(2) and 
so k = 16 lb/ft. Thus, since m = w/g = y§ = 1 (slug), and the damping constant 
a = 4, Equation (5.27) becomes 


d 2 x dx 

- 2 “1" 4 — — K 16x — 0. 
dt 2 dt 


(5.37) 


The initial conditions are 

*(0) = i 

x'(0) = 0. 


(5.38) 


Solution. The auxiliary equation of Equation (5.37) is 

r 2 + 4r + 16 = 0. 

Its roots are the conjugate complex numbers — 2 ± 2^/3 i. Thus the general solution of 
(5.37) may be written 

x = e~ 2t (c l sin 2^/3 1 + c 2 cos 2^/3 t), (5.39) 

where c l and c 2 are arbitrary constants. Differentiating (5.39) with respect to t we 
obtain 


~ = e~ 2, [( — 2ci - 2y3c 2 )sin 2^3 1 + (2 y /3c t - 2c 2 )cos 2^0- (5-40) 

Applying the initial conditions (5.38) to Equations (5.39) and (5.40), we obtain 

r - 1 
C 2 ~ 2 > 

2 x /3ci — 2 c 2 = 0. 

Thus c x = y/3/6, c 2 = \ and the solution is 

x = e -»(Jl sin 2^31 + ^008 273^. (5.41) 

Let us put this in the alternative form (5.32). We have 

2^- sin 2^/3 1 + ^ cos 2^/3 1 = |^- sin 2 x /3 1 + cos 2y/3t J 




cos 




Thus the solution (5.41) may be written 


= ^e 2 ‘ cos (ly/lt - ^ 


(5.42) 
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Interpretation. This is a damped oscillatory motion (Case 1). The damping factor is 
{ y /3/3)e~ 2t , the “period” is 271/2^3 = ^371/3, and the logarithmic decrement is 
2 x /3tc/ 3. The graph of the solution (5.42) is shown in Figure 5.7, where the curves x = 
±( v /3/3)e~ 2f are drawn dashed. 

► Example 5.3 

Determine the motion of the weight on the spring described in Example 5.2 if the 
resistance of the medium in pounds is numerically equal to 8 (dx/dt) instead of 4 (dx/dt) 
(as stated there), all other circumstances being the same as stated in Example 5.2. 

Formulation. Once again Equation (5.27) applies, and exactly as in Example 5.2 we 
find that m = 1 (slug) and k = 16 lb/ft. But now the damping has increased, and we 
have a = 8. Thus Equation (5.27) now becomes 

d 2 x dx 

— 2 “I" ^ 1 — 1" 16x = 0. (5.43) 

dt dt 


The initial conditions 

x(0) = i, 
x'(0) = 0, 

are, of course, unchanged from Example 5.2. 


(5.44) 


Solution. The auxiliary equation is now 

r 2 + 8r + 16 = 0 

and has the equal roots r = -4, -4. The general solution of Equation (5.43) is thus 

x = (c t + c 2 t)e~ 4t , (5.45) 



Figure 5.7 
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where c t and c 2 are arbitrary constants. Differentiating (5.45) with respect to t , we have 

dx 

— = (c 2 -4c i - 4 c 2 t)e~ 4t . (5.46) 

Applying the initial conditions (5.44) to Equations (5.45) and (5.46) we obtain 

C 1 = 25 

c 2 — 4 Cj = 0. 

Thus c t = j, c 2 = 2 and the soltuion is 

x = (i + 2 t)e~ 4 ‘. (5.47) 

Interpretation. The motion is critically damped. Using (5.47), we see that x = 0 if 
and only if t = — J. Thus x # 0 for t > 0 and the weight does not pass through its 
equilibrium position. Differentiating (5.47) one finds at once that dx/dt < 0 for all 
t > 0. Thus the displacement of the weight from its equilibrium position is a decreas- 
ing function of t for all t > 0. In other words, the weight starts to move back toward 
its equilibrium position at once and x -► 0 monotonically as t -> oo. The motion is 
therefore an example of possibility 1 described in the general discussion of Case 2 
above. The graph of the solution (5.47) is shown as the solid curve in Figure 5.8. 



Figure 5.8 

► Example 5.4 

Determine the motion of the weight on the spring described in Example 5.2 if the re- 
sistance of the medium in pounds is numerically equal to 10 (dx/dt) instead of 4 (dx/dt) 
(as stated there), all other circumstances being the same as stated in Example 5.2. 

Formulation. The only difference between this and the two previous examples is in 
the damping constant. In Example 5.2, a = 4; in Example 5.3, a = 8; and now we have 
even greater damping, for here a = 10. As before m = 1 (slug) and k = 16 lb/ft. The 
differential equation (5.27) thus becomes 

^4+ 10^+ 16x = 0. (5.48) 

dt 2 dt 

The initial conditions (5.44) or (5.38) still hold. 
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Solution. The auxiliary equation is now 

r 2 + lOr + 16 = 0 

and its roots are r = — 2, —8. Thus the general solution of Equation (5.48) is 

x = c x e~ 2t + c 2 e~ 8 \ 

where c x and c 2 are arbitrary constants. Differentiating this with respect to t to obtain 

^ = — 2c x e~ 2t — 8c 2 e~ 8r 
at 

and applying the initial conditions (5.44), we find the following equations for the 
determination of c x and c 2 : 

c i + c 2 = h 

— 2c l — 8c 2 = 0. 

The solution of this system is c x = § , c 2 = — thus the solution of the problem is 

x= l e ~2t _i e -st ( 5 . 49 ) 

Interpretation. Clearly the motion described by Equation (5.49) is an example of 
the overdamped case (Case 3). Qualitatively the motion is the same as that of the 
solution (5.47) of Example 5.3. Here, however, due to the increased damping, the weight 
returns to its equilibrium position at a slower rate. The graph of (5.49) is shown as the 
dashed curve in Figure 5.8. Note that in each of Examples 5.2, 5.3, and 5.4, all 
circumstances (the weight, the spring, and the initial conditions) were exactly the same, 
except for the damping. In Example 5.2, the damping constant a = 4, and the resulting 
motion was the damped oscillatory motion shown in Figure 5.7. In Example 5.3 the 
damping was increased to such an extent ( a = 8) that oscillations no longer occurred, 
the motion being shown by the solid curve of Figure 5.8. Finally in Example 5.4 the 
damping was further increased {a = 10) and the resulting motion, indicated by the 
dashed curve of Figure 5.8, was similar to but slower than that of Example 5.3. 


Exercises 

1. An 8-lb weight is attached to the lower end of a coil spring suspended from the 
ceiling and comes to rest in its equilibrium position, thereby stretching the spring 
0.4 ft. The weight is then pulled down 6 in. below its equilibrium position and 
released at t = 0. The resistance of the medium in pounds is numerically equal to 
2 (dx/dt), where dx/dt is the instantaneous velocity in feet per second. 

(a) Set up the differential equation for the motion and list the initial conditions. 

(b) Solve the initial-value problem set up in part (a) to determine the 
displacement of the weight as a function of the time. 

(c) Express the solution found in step (b) in the alternative form (5.32) of the text. 

(d) What is the so-called “period” of the motion? 

(e) Graph the displacement as a function of the time. 
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2. A 16-lb weight is placed upon the lower end of a coil spring suspended from the 
ceiling and comes to rest in its equilibrium position, thereby stretching the spring 
8 in. At time t = 0 the weight is then struck so as to set it into motion with an 
initial velocity of 2 ft/sec, directed downward. The medium offers a resistance in 
pounds numerically equal to 6 (dx/dt), where dx/dt is the instantaneous velocity 
in feet per second. Determine the resulting displacement of the weight as a func- 
tion of time and graph this displacement. 

3. An 8-lb weight is attached to the lower end of a coil spring suspended from a fixed 
support. The weight comes to rest in its equilibrium position, thereby stretching 
the spring 6 in. The weight is then pulled down 9 in. below its equilibrium position 
and released at t = 0. The medium offers a resistance in pounds numerically equal 
to 4 (dx/dt), where dx/dt is the instantaneous velocity in feet per second. 
Determine the displacement of the weight as a function of the time and graph this 
displacement. 

4. A 16-lb weight is attached to the lower end of a coil spring suspended from a fixed 
support. The weight comes to rest in its equilibrium position, thereby stretching 
the spring 6 in. The weight is then pulled down 3 in. below its equilibrium position 
and released at t = 0. The medium offers a resistance in pounds numerically equal 
to 10 (dx/dt), where dx/dt is the instantaneous velocity in feet per second. 
Determine the displacement of the weight as a function of the time and graph this 
displacement. 

5. A spring is such that a force of 20 lb would stretch it 6 in. The spring hangs 
vertically and a 4-lb weight is attached to the end of it. After this 4-lb weight comes 
to rest in its equilibrium position it is pulled down 8 in. below this position and 
then released at t = 0. The medium offers a resistance in pounds numerically 
equal to 2 (dx/dt), where dx/dt is the instantaneous velocity in feet per second. 

(a) Determine the displacement of the weight as a function of the time and 
express this displacement in the alternative form (5.32) of the text. 

(b) Find the so-called “period” and determine the logarithmic decrement. 

(c) At what time does the weight first pass through its equilibrium position? 

6. A 4-lb weight is hung upon the lower end of a coil spring hanging vertically from a 
fixed support. The weight comes to rest in its equilibrium position, thereby 
stretching the spring 8 in. The weight is then pulled down a certain distance below 
this equilibrium position and released at t = 0. The medium offers a resistance in 
pounds numerically equal to a(dx/dt ), where a > 0 and dx/dt is the instantaneous 
velocity in feet perj>econd. Show that the motion is oscillatory if a < <v /3, critically 
damped if a = y/3, and overdamped if a > y/3. 

7. A 4-lb weight is attached to the lower end of a coil spring that hangs vertically 
from a fixed support. The weight comes to rest in its equilibrium position, thereby 
stretching the spring 6 in. The weight is then pulled down 3 in. below this 
equilibrium position and released at t = 0. The medium offers a resistance in 
pounds numerically equal to a(dx/dt ), where a > 0 and dx/dt is the instantaneous 
velocity in feet per second. 

(a) Determine the value of a such that the resulting motion would be critically 
damped and determine the displacement for this critical value of a. 
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(b) Determine the displacement if a is equal to one-half the critical value found 
in step (a). 

(c) Determine the displacement if a is equal to twice the critical value found in 
step (a). 

8. A 10-lb weight is attached to the lower end of a coil spring suspended from the 
ceiling, the spring constant being 20 lb/ft. The weight comes to rest in its 
equilibrium position. It is then pulled down 6 in. below this position and released 
at t = 0 with an initial velocity of 1 ft/sec, directed downward. The resistance of 
the medium in pounds is numerically equal to a(dx/dt ), where a > 0 and dx/dt is 
the instantaneous velocity in feet per second. 

(a) Determine the smallest value of the damping coefficient a for which the 
motion is not oscillatory. 

(b) Using the value of a found in part (a) find the displacement of the weight as a 
function of the time. 

(c) Show that the weight attains a single extreme displacement from its 
equilibrium position at time t = determine this extreme displacement, 
and show that the weight then tends monotonically to its equilibrium 
position as t -► oo. 

(d) Graph the displacement found in step (b). 

9. A 32-lb weight is attached to the lower end of a coil spring suspended from the 
ceiling. After the weight comes to rest in its equilibrium position, it is then pulled 
down a certain distance below this position and released at t = 0. If the medium 
offered no resistance, the natural frequency of the resulting undamped motion 
would be 4/ u cycles per second. However, the medium does offer a resistance in 
pounds numerically equal to a(dx/dt ), where a > 0 and dx/dt is the instantaneous 
velocity in feet per second; and the frequency of the resulting damped oscillatory 
motion is only half as great as the natural frequency. 

(a) Determine the spring constant k. 

(b) Find the value of the damping coefficient a . 

10. The differential equation for the vertical motion of a mass m on a coil spring of 
spring constant k in a medium in which the damping is proportional to the 
instantaneous velocity is given by Equation (5.27). In the case of damped 
oscillatory motion the solution of this equation is given by (5.33). Show that the 
displacement x so defined attains an extremum (maximum or minimum) at the 
times t n (n = 0, 1,2,...) given by 

1 


where 


k a 2 
m Am 2 ’ 




11 . 


The differential equation for the vertical motion of a unit mass on a certain coil 
spring in a certain medium is 


d 2 x 

dt 2 


+ 


^ dx . 2 
2b— - + b 2 x = 
dt 


0 , 
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where b > 0. The initial displacement of the mass is A feet and its initial velocity is 

B feet per second. 

(a) Show that the motion is critically damped and that the displacement is given 
by 

x = (A + Bt + Abt)e~ bt . 

(b) If A and B are such that 

A B 

B + Ab an b{B + Ab) 

are both negative, show that the mass approaches its equilibrium position 
monotonically as t -* oo without either passing through this equilibrium 
position or attaining an extreme displacement from it for t > 0. 

(c) If A and B are such that —A/(B + Ab) is negative but B/b(B -f Ab) is 
positive, show that the mass does not pass through its equilibrium position 
for t > 0, that its displacement from this position attains a single extremum 
at t = B/b(B + Ab), and that thereafter the mass tends to its equilibrium 
position monotonically as t -► oo. 

(d) If A and B are such that — A/(B + Ab) is positive, show that the mass passes 
through its equilibrium position at t = —A/(B + Ab), attains an extreme 
displacement at t = B/b(B + Ab), and thereafter tends to its equilibrium 
position monotonically as t -> oo. 


5.4 FORCED MOTION 


We now consider an important special case of forced motion. That is, we not only 
consider the effect of damping upon the mass on the spring but also the effect upon it of 
a periodic external impressed force F defined by F(t) = F t cos cot for all t > 0, where F } 
and (o are constants. Then the basic differential equation (5.7) becomes 


m + a ^ + kx = F i cos wt. 
dt 2 dt 


(5.50) 


Dividing through by m and letting 



m 




1 5 


this takes the more convenient form 


+2 b -y- + A 2 x = E t COS COt. 

dt 2 dt 


(5.51) 


We shall assume that the positive damping constant a is small enough so that the 
damping is less than critical. In other words we assume that b < X. Hence by Equation 
(5.32) the complementary function of Equation (5.51) can be written 

x c = ce~ bt cos {yj X 2 — b 2 t -f 0). 


(5.52) 
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We shall now find a particular integral of (5.51) by the method of undetermined 
coefficients. We let 

x p = A cos cot + B sin cot. (5.53) 

Then 


dxp 

dt 

d 2 x p 
dt 2 


— co A sin cot + coB cos cot , 

— co 2 A cos cot — co 2 B sin cot. 


Substituting into Equation (5.51), we have 

[ — IbcoA + (X 2 — co 2 )B ~\ sin cot + [(A 2 - co 2 )A + 2ko£]cos cot = E x cos cot. 
Thus, we have the following two equations from which to determine A and B: 


Solving these, we obtain 


— IbcoA + (A 2 — co 2 )B = 0, 
(A 2 — co 2 ) A + 2bcoB = E l . 

£,(A 2 ~ co 2 ) 

[X 2 - to 2 ) 2 + 4b 2 o> 2 \ 

_ IbwEi 

(X 2 - co 2 ) 2 + 4b 2 co 2 ' 


(5.54) 


Substituting these values into Equation (5.53), we obtain a particular integral in the 
form 

£ 

Xp = (A 2 - o) 2 ) 2 + 4 bW [(A2 " " 2)cos 0)1 + 2bco sin 


We now put this in the alternative “phase angle” form; we write 
(A 2 — co 2 )cos cot + 2 bco sin cot 


= ^/(A 2 - co 2 ) 2 + 4 b 2 


A 2 — co 2 


,J(k 2 - co 2 ) 2 + 4b 2 o) 2 

+ 


cos cot 


2bco 


yj{)} - co 2 ) 2 + 4b 


2 co 2 


sin cot 


— co 2 ) 2 + 4b 2 co 2 [cos cot cos 9 + sin cot sin 0] 
= ^/(A 2 — o 2 ) 2 + 4 b 2 ft> 2 cos (art - 9), 


where 


cos 9 = 


sin 9 = 


A 2 - co 2 


^/(A 2 - co 2 ) 2 + 4b 2 co 2 ’ 

2ko 

^/(A 2 - o) 2 ) 2 + 4h 2 ft) 2 


(5.55) 
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Thus the particular integral appears in the form 


E i 


J(X 2 - O) 2 ) 1 + 4b 2 or 


cos (cot — 6 ), 


(5.56) 


where 6 is determined from Equations (5.55). Thus, using (5.52) and (5.56) the general 
solution of Equation (5.51) is 


= x c + x p = ce bt cos^A 2 — b 2 t + </>) + 


£i 


ya 2 - co 2 ) 2 + 4b 2 or 


: cos (cot — 6). 


(5.57) 


Observe that t his solution is the sum of two terms. The first term, 
ce~ bt cos(Jh 2 — b 2 t + 0), represents the damped oscillation that would be the entire 
motion of the system if the external force F t cos cot were not present. The second term, 


E i 

J(X 2 - co 2 ) 2 + 4 b 2 co 2 


cos (cot — 0), 


which results from the presence of the external force, represents a simple harmonic 
motion of period In/oo. Because of the damping factor ce~ bt the contribution of the 
first term will become smaller and smaller as time goes on and will eventually become 
negligible. The first term is thus called the transient term. The second term, however, 
being a cosine term of constant amplitude, continues to contribute to the motion in a 
periodic, oscillatory manner. Eventually, the transient term having become relatively 
small, the entire motion will consist essentially of that given by this second term. This 
second term is thus called the steady-state term. 


► Example 5.5 

A 16-lb weight is attached to the lower end of a coil spring suspended from the ceiling, 
the spring constant of the spring being 10 lb/ft. The weight comes to rest in its 
equilibrium position. Beginning at t = 0 an external force given by F(t) = 5 cos 2 1 is 
applied to the system. Determine the resulting motion if the damping force in pounds is 
numerically equal to 2 (dx/dt), where dx/dt is the instantaneous velocity in feet per 
second. 


Formulation. The basic differential equation for the motion is 


d 2 X 

m + a — + kx = F(t). 


dx 

dt 


(5.58) 


Here m = w/g = j§ = \ (slug), a = 2, k = 10, and F(t) = 5 cos It. Thus Equation (5.58) 
becomes 


1 d 2 x ^ dx 

- — ~t + 2 — + lOx = 5 cos It 

2 dt 2 dt 


or 


d 2 x dx 

— ~y + 4 — + 20x = 10 cos 2 1. 
dt 2 dt 


(5.59) 
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The initial conditions are 

x(0) = 0, 
x'(0) = 0. 


(5.60) 


Solution. The auxiliary equation of the homogeneous equation corresponding to 
(5.59) is r 2 + 4r + 20 = 0; its roots are — 2 ± A i. Thus the complementary function of 
Equation (5.59) is 

x c = e~ 2t (c l sin At + c 2 cos At), 

where c l and c 2 are arbitrary constants. Using the method of undetermined coefficients 
to obtain a particular integral, we let 

x p = A cos 2t + B sin 2 1. 

Upon differentiating and substituting into (5.59), we find the following equations for 
the determination of A and B. 

-8 A + 16J3 = 0, 

\6A + 8J 3= 10. 

Solving these, we find 

A=h B=l 

Thus a particular integral is 

x p =i cos 2t + i sin 2 1 
and the general solution of (5.59) is 

x = x c + x p = e~ 2t (c l sin At + c 2 cos 4t) + j cos 2 1 + i sin 2 1. (5.61) 

Differentiating (5.61) with respect to t, we obtain 
dx 

— = e~ 2t l( — 2c l — 4c 2 )sin At + ( — 2c 2 + 4c t )cos At~\ - sin -It + |cos 2 1. (5.62) 
Applying the initial conditions (5.60) to Equations (5.61) and (5.62), we see that 

c 2 + 2 = 0, 

4c ! - 2 c 2 + \ = 0. 

From these equations we find that 

r - _3 _ _i 

C 1 — 8 ’ c 2 ~ 2 - 

Hence the solution is 

x = c~ 2 '(-| sin At - j cos At) + j cos 2t + | sin It. (5.63) 
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Also, we have 


2 cos It + sin It = yfl cos 2 1 + sin 2t J = ^/5 cos(2t — 6 ), 


where 


2 1 
cos 6 = — — , sin 6 = — — . 

x/5 75 


Thus we may write the solution (5.63) as 

5e~ 2t 


x = 


77 


g cos(4r — 0) + cos(2r — 0), 


(5.65) 


(5.66) 


where </> and 6 are determined by Equations (5.64) and (5.65), respectively. We find that 
(j) % 0.64 (rad) and 9 % 0.46 (rad). Thus the solution (5.66) is given approximately by 

x = - 0.63c ~ 2 ' cos(4f — 0.64) + 0.56 cos(2t — 0.46). 


Interpretation. The term 


5c “ 2r 
8 


cos(4 1 


0) 


-0.63c 2t cos(4 1 — 0.64) 


is the transient term, representing a damped oscillatory motion. It becomes negligible in 
a short time; for example, for t > 3, its numerical value is less than 0.002. Its graph is 
shown in Figure 5.9a. The term 


75 


cos(2 1 - 6) % 0.56 cos(2t — 0.46) 


is the steady-state term, representing a simple harmonic motion of amplitude 



and period n. Its graph appears in Figure 5.9 b. The graph in Figure 5.9c is that of the 
complete solution (5.66). It is clear from this that the effect of the transient term soon 
becomes negligible, and that after a short time the contribution of the steady-state term 
is essentially all that remains. 


Exercises 

1. A 61b weight is attached to the lower end of a coil spring suspended from 
the ceiling, the spring constant of the spring being 27 lb/ft. The weight comes to 
rest in its equilibrium position, and beginning at t = 0 an external force given by 
F(t) = 12 cos 20 1 is applied to the system. Determine the resulting displacement 
as a function of the time, assuming damping is negligible. 

2. A 16-lb weight is attached to the lower end of a coil spring suspended from the 
ceiling. The weight comes to rest in its equilibrium position, thereby stretching the 
spring 0.4 ft. Then, beginning at t = 0, an external force given by F(t) = 40 cos 1 6 1 
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is applied to the system. The medium offers a resistance in pounds numerically 
equal to 4 (dx/dt), where dx/dt is the instantaneous velocity in feet per second. 

(a) Find the displacement of the weight as a function of the time. 

(b) Graph separately the transient and steady-state terms of the motion found in 
step (a) and then use the curves so obtained to graph the entire displacement 
itself. 

3. A 10-lb weight is hung on the lower end of a coil spring suspended from the 
ceiling, the spring constant of the spring being 20 lb/ft. The weight comes to rest 
in its equilibrium position, and beginning at t = 0 an external force given by 
F(t) = 10 cos 8 1 is applied to the system. The medium offers a resistance in 
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pounds numerically equal to 5 (dx/dt), where dx/dt is the instantaneous velocity in 
feet per second. Find the displacement of the weight as a function of the time. 

4. A 4-lb weight is hung on the lower end of a coil spring suspended from a beam. 
The weight comes to rest in its equilibrium position, thereby stretching the spring 
3 in. The weight is then pulled down 6 in. below this position and released at t = 0. 
At this instant an external force given by F(t) = 13 sin 4r is applied to the system. 
The resistance of the medium in pounds is numerically equal to twice the 
instantaneous velocity, measured in feet per second. 

(a) Find the displacement of the weight as a function of the time. 

(b) Observe that the displacement is the sum of a transient term and a steady- 
state term, and find the amplitude of the steady-state term. 


5. A 6-lb weight is hung on the lower end of a coil spring suspended from the ceiling. 
The weight comes to rest in its equilibrium position, thereby stretching the 
spring 4 in. Then beginning at t = 0 an external force given by F(t) = 27 
sin 4r — 3 cos At is applied to the system. If the medium offers a resistance in 
pounds numerically equal to three times the instantaneous velocity, measured 
in feet per second, find the displacement as a function of the time. 

6. A certain coil spring having spring constant 10 lb/ft is suspended from the ceiling. 
A 32-lb weight is attached to the lower end of the spring and comes to rest in 
its equilibrium position. Beginning at t = 0 an external force given by F(t) = 
sin t + 1 sin 2t + ^ sin 3t is applied to the system. The medium offers a resist- 
ance in pounds numerically equal to twice the instantaneous velocity, measured 
in feet per second. Find the displacement of the weight as a function of the time, 
using Chapter 4, Theorem 4.10 to obtain the steady-state term. 


7. A coil spring having spring constant 20 lb/ft is suspended from the ceiling. A 32-lb 
weight is attached to the lower end of the spring and comes to rest in its 
equilibrium position. Beginning at t = 0 an external force given by F(t) = 
40 cos It is applied to the system. This force then remains in effect until t = n, 
at which instant it ceases to be applied. For t > n, no external forces are present. 
The medium offers a resistance in pounds numerically equal to A(dx/dt), where 
dx/dt is the instantaneous velocity in feet per second. Find the displacement of the 
weight as a function of the time for all t > 0. 


8 . 


Consider the basic differential equation (5.7) for the motion of a mass m vibrating 
up and down on a coil spring suspended vertically from a fixed support; and 
suppose the external impressed force F is the periodic function defined by 
F(t) = F 2 sin cot for all t > 0, where F 2 and co are constants. Show that in this case 
the steady-state term in the solution of Equation (5.7) may be written 


y/(/, 2 - oo 2 ) 2 + Ab 2 co : 


: sin (cot —.0), 


where b = a/2m , / 2 = k/m , E 2 = F 2 /m , and 6 is determined from Equations 
(5.55). 


9. A 32-lb weight is attached to the lower end of a coil spring suspended vertically 
from a fixed support and comes to rest in its equilibrium position, thereby 
stretching the spring 6 in. Beginning at t = 0 an external force given by F(t) = 
15 cos It is applied to the system. Assume that the damping is negligible. 
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(a) Find the displacement of the weight as a function of the time. 

(b) Show that this displacement may be expressed as x = A(t)sin(\5t/2), where 
A(t) = 2 sin \t. The function A(t) may be regarded as the “slowly varying” 
amplitude of the more rapid oscillation sin(15r/2). When a phenomenon 
involving such fluctuations in maximum amplitude takes place in acoustical 
applications, beats are said to occur. 

(c) Carefully graph the slowly varying amplitude A(t) = 2 sin(t/2) and its 
negative -A(t) and then use these “bounding curves” to graph the 
displacement x = A(t)sin(\5t/2). 

10. A 16-lb weight is attached to the lower end of a coil spring that is suspended 
vertically from a support and for which the spring constant k is 10 lb/ft. The 
weight comes to rest in its equilibrium position and is then pulled down 6 in. 
below this position and released at t = 0. At this instant the support of the spring 
begins a vertical oscillation such that its distance from its initial position is given 
by j sin 2 1 for t > 0. The resistance of the medium in pounds is numerically equal 
to 2 (dx/dt), where dx/dt is the instantaneous velocity of the moving weight in feet 
per second. 

(a) Show that the differential equation for the displacement of the weight from 
its equilibrium position is 

1 d 2 x . ^dx 1 . . 

- = — 10(x — y) — 2 — , where v = - sin 2 t, 

2 dt 2 x y) dt y 2 

and hence that this differential equation may be written 

d 2 x A dx _ _ . ^ 

77 + 4 — + 20x = 10 sin 2 1. 
dt 2 dt 

(b) Solve the differential equation obtained in step (a), apply the relevant initial 
conditions, and thus obtain the displacement x as a function of time. 


5.5 RESONANCE PHENOMENA 


We now consider the amplitude of the steady-state vibration that results from the 
periodic external force defined for all t by F(t) = F t cos cot , where we assume that 
F t > 0. For fixed b , 2, and E t we see from Equation (5.56) that this is the function / 
of co defined by 


/M = 


^/(A 2 - co 2 ) 2 + 4 b 2 co 2 


(5.67) 


If co = 0, the force F(t) is the constant F l and the amplitude / (co) has the value 
E l /X 2 > 0. Also, as co — > oo, we see from (5.67) that /(co)->0. Let us consider the 
function / for 0 < co < oo. Calculating the derivative f'(co) we find that this deriva- 
tive equals zero only if 

4co[2b 2 - (2 2 - co 2 )] = 0 

and hence only if co = 0 or co = ^JX 2 - 2b 2 . If X 2 < 2b 2 , y JX 2 — 2 b 2 is a complex 
number. Hence in this case / has no extremum for 0 < co < oo, but rather / decreases 
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monotonically for 0 < co < oo from the value E i /X 2 at co = 0 and approaches zero as 
co -► oo. Let us as sume that X 2 > 2b 2 . Then the function / has a relative maximum at 
co i = yjX 2 — 2b 2 , and this maximum value is given by 

E , 


/(© i) = 




y/(2b 2 ) 2 + 4fo 2 (A 2 - 26 2 ) 2b y JX 2 - b 2 


When the frequency of the forcing function cos cot is such that 60 = 0 ;!, then the 
forcing function is said to be in resonance with the system. In other words, the forcing 
function defined by F t cos cot is in resonance with the system when co assumes the value 
co l at which f(co) is a maximum. The value co i /2n is called the resonance frequency of 
the system. Note carefully that resonance can occur only if X 2 > 2b 2 . Since then 
X 2 > b 2 , the damping must be less than critical in such a case. 

We now return to the original notation of Equation (5.50). In terms of the quantities 
m, a , k , and F t of that equation, the function / is given by 


/M = 


Ft 

m 




(5.68) 


In this original notation the resonance frequency is 


2n 


r- 

m 


a 

2 m 2 ' 


(5.69) 


Since the frequency of the corresponding free, damped oscillation is 


2n 


a 

4 m 2 ’ 


we see that the resonance frequency is less than that of the corresponding free, damped 
oscillation. 

The graph of /(co) is called the resonance curve of the system. For a given system with 
fixed m, k, and F u there is a resonance curve corresponding to each value of the 
damping coefficient a > 0. Let us choose m = k = F i = 1, for example, and graph the 
resonance curves corresponding to certain selected values of a. In this case we have 


J T T 

^/(l — co 2 ) 2 + a 2 co 2 

and the resonance frequency is given by (l/2n)y/l - a 2 / 2. The graphs appear in 
Figure 5.10. 

Observe that resonance occurs in this case only if a < J2. As a decreases from yjl to 
0, the value co l at which resonance occurs increases from 0 to 1 and the corresponding 
maximum value of f(co) becomes larger and larger. In the limiting case a = 0, the 
maximum has disappeared and an infinite discontinuity occurs at co = 1. In this case 
our solution actually breaks down, for then 


m = 


i 

x /( l -“ 2 ) 2 


1 


1 - CO 2 
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and /( 1) is undefined. This limiting case is an example of undamped resonance , a 
phenomenon that we shall now investigate. 

Undamped resonance occurs when there is no damping #nd the frequency of the 
impressed force is equal to the natural frequency of the system. Si nce i n this case a = 0 
and the frequency co/27i equals the natural frequency (1 /2it)y/k/m, the differential 
equation (5.50) reduces to 


m 


d 2 x 


+ kx = F t cos — t 


m 


or 


d 2 x k 
dt ~ m 


it ^ rv / ' v , 

-TT + — X = £, COS /— t, 


It 

m 


where E t = FJm. Since the complementary function of Equation (5.70) is 

• ft ft 

x c = c t sin /— t + c 2 cos ^ /— U 


m 


m 


(5.70) 


(5.71) 


we cannot assume a particular integral of the form 



A sin 
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Rather we must assume 


x 


p 


= At sin 


t + Bt cos 


m 



Differentiating this twice and substituting into Equation (5.70), we find that 


A 



B = 0. 


Thus th e par ticular integral of Equation (5.70) resulting from the forcing function 
E t cos yfkjmt is given by 


x 


p 



Expressing the complementary function (5.7 1) in the equivalent “phase-angle” form, we 
see that the general solution of Equation (5.70) is given by 


x = c cos 



t. 


(5.72) 


The motion defined by (5. 72) is thus the sum of a periodic term and an oscillatory term 
whose amplitude (E i /2) y /mjkt increases with t. The graph of the function defined by 
this latter term, 


Ei 

2 



appears in Figure 5.11. As t increases, this term clearly dominates the entire motion. 
One might argue that Equation (5.72) informs us that as t -+ oo the oscillations will 
become infinite. However, common sense intervenes and convinces us that before this 
exciting phenomenon can occur the system will break down and then Equation (5.72) 
will no longer apply! 



Figure 5.11 
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► Example 5.6 

A 64-lb weight is attached to the lower end of a coil spring suspended from the ceiling, 
the spring constant being 18 lb/ft. The weight comes to rest in its equilibrium position. 
It is then pulled down 6 in. below its equilibrium position and released at t = 0. At this 
instant an external force given by F(t) = 3 cos cot is applied to the system. 

1. Assuming the damping force in pounds is numerically equal to 4 (dx/dt), where 
dx/dt is the instantaneous velocity in feet per second, determine the resonance 
frequency of the resulting motion. 

2. Assuming there is no damping, determine the value of co that gives rise to 
undamped resonance. 


Solution. Since m = w/g = ff = 2 (slugs), k = 18, and F(t) = 3 cos, cot, the dif- 
ferential equation is 


d^x dx 

2 — -7T + a — + 1 8x = 3 cos cot , 
dt 2 dt 


where a is the damping coefficient. In Part 1, a = 4 and so in this case the differential 
equation reduces to 


- d 2 x dx 

2 — y- + 4 — + 1 8x = 3 cos cot. 
dt 2 dt 


Here we are not asked to solve the differential equation but merely to determine the 
resonance frequency. Using formula (5.69) we find that this is 

1 /l8 1 / 16 \ 1 r- ^ , , , 

S -Jj “ 2 (t j - S ' 'P * 042 (cyc ' es/sec) ' 

Thus resonance occurs when co = Jl « 2.65. 

In Part 2, a = 0 and so the differential equation reduces to 


dt 2 


+ 9x = | cos cot. 


(5.73) 


Undamped resonance occurs when the frequency oo/ln of the impressed force is equal 
to the natural frequency. The complementary function of Equation (5.73) is 

x c = Cj sin 3 1 + c 2 cos 3 1 , 

and from this we see that the natural frequency is 3/2n. Thus co = 3 gives rise to 
undamped resonance and the differential equation (5.73) in this case becomes 

+ 9x = | cos 3 1. (5.74) 

The initial conditions are x(0) = j, x'(0) = 0. The reader should show that the solution 
of Equation (5.74) satisfying these conditions is 

x = ^-cos 3 1 + it sin 3 1. 
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Exercises 

1. A 12-lb weight is attached to the lower end of a coil spring suspended from the 
ceiling. The weight comes to rest in its equilibrium position thereby stretching the 
spring 6 in. Beginning at t = 0 an external force given by F(t) = 2 cos cot is applied 
to the system. 

(a) If the damping force in pounds is numerically equal to 3 (dx/dt), where dx/dt is 
the instantaneous velocity in feet per second, determine the resonance 
frequency of the resulting motion and find the displacement as a function of 
the time when the forcing function is in resonance with the system. 

(b) Assuming there is no damping, determine the value of co that gives rise to 
undamped resonance and find the displacement as a function of the time in 
this case. 

2. A 20-lb weight is attached to the lower end of a coil spring suspended from the 
ceiling. The weight comes to rest in its equilibrium position, thereby stretching the 
spring 6 in. Various external forces of the form F(t) = cos cot are applied to the 
system and it is found that the resonance frequency is 0.5 cycles/sec. Assuming that 
the resistance of the medium in pounds is numerically equal to a(dx/dt ), where 
dx/dt is the instantaneous velocity in feet per second, determine the damping 
coefficient a. 

3. The differential equation for the motion of a unit mass on a certain coil spring 
under the action of an external force of the form F(t) = 30 cos cot is 

^r-r- + a ^ + 24x = 30 cos cot, 
dt 2 dt 

where a > 0 is the damping coefficient. 

(a) Graph the resonance curves of the system for a = 0, 2, 4, 6, and 4y/3. 

(b) If a = 4, find the resonance frequency and determine the amplitude of the 
steady-state vibration when the forcing function is in resonance with the 
system. 

(c) Proceed as in part (b) if a — 2. 

5.6 ELECTRIC CIRCUIT PROBLEMS 

In this section we consider the application of differential equations to series circuits 
containing (1) an electromotive force, and (2) resistors, inductors, and capacitors. We 
assume that the reader is somewhat familiar with these items and so we shall avoid an 
extensive discussion. Let us simply recall that the electromotive force (for example, a 
battery or generator) produces a flow of current in a closed circuit and that this current 
produces a so-called voltage drop across each resistor, inductor, and capacitor. Further, 
the following three laws concerning the voltage drops across these various elements are 
known to hold: 

1. The voltage drop across a resistor is given by 

E r = Ri, (5.75) 

where R is a constant of proportionality called the resistance, and i is the current. 
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2. The voltage drop across an inductor is given by 


El 



(5.76) 


where L is a constant of proportionality called the inductance , and i again denotes the 
current. 

3. The voltage drop across a capacitor is given by 



(5.77) 


where C is a constant of proportionality called the capacitance and q is the 
instantaneous charge on the capacitor. Since i = dq/dt , this is often written as 



The units in common use are listed in Table 5.1. 


TABLE 5.1 


Quantity and symbol 

Unit 

emf or voltage E 

volt (V) 

current i 

ampere 

charge q 

coulomb 

resistance R 

ohm (Q) 

inductance L 

henry (H) 

capacitance C 

farad 


The fundamental law in the study of electric circuits is the following: 

Kirchhoff's Voltage Law (Form 1). The algebraic sum of the instantaneous voltage 
drops around a close circuit in a specific direction is zero. 

Since voltage drops across resistors, inductors, and capacitors have the opposite sign 
from voltages arising from electromotive forces, we may state this law in the following 
alternative form: 

Kirchhoff's Voltage Law (Form 2). The sum of the voltage drops across resistors, 
inductors, and capacitors is equal to the total electromotive force in a closed circuit. 

We now consider the circuit shown in Figure 5.12. 



Figure 5.12 




5.6 ELECTRIC CIRCUIT PROBLEMS 213 


Here and in later diagrams the following conventional symbols are employed: 



— \Wv — R 


-vj IflftftSb” L 

— II — c 


Electromotive force (battery or generator) 

Resistor 

Inductor 

Capacitor 


Let us apply Kirchhoff’s law to the circuit of Figure 5.12. Letting E denote the 
electromotive force, and using the laws 1, 2, and 3 for voltage drops that were given 
above, we are led at once to the equation 

L— + Ri + — q = E. (5.78) 


This equation contains two dependent variables i and q. However, we recall that these 
two variables are related to each other by the equation 



(5.79) 


Using this we may eliminate i from Equation (5.78) and write it in the form 

, d 2 q , n dq _ 1 


dt 2 


* R f t + z q ~ E - 


(5.80) 


Equation (5.80) is a second-order linear differential equation in the single dependent 
variable q. On the other hand, if we differentiate Equation (5.78) with respect to t and 
make use of (5.79), we may eliminate q from Equation (5.78) and write 


_ d 2 i di 1 dE 

L ~t~2 + R ~t~ T ~p, 1 = ~7~ * 
dt 1 dt C dt 


(5.81) 


This is a second-order linear differential equation in the single dependent variable i. 

Thus we have the two second-order linear differential equations (5.80) and (5.81) for 
the charge q and current i , respectively. Further observe that in two very simple cases 
the problem reduces to a first - order linear differential equation. If the circuit contains 
no capacitor, Equation (5.78) itself reduces directly to 

r di 

L — — f- Ri = E; 
dt 


while if no inductor is present, Equation (5.80) reduces to 




q = E. 


Before considering examples, we observe an interesting and useful analogy. The 
differential equation (5.80) for the charge is exactly the same as the differential equation 
(5.7) of Section 5.1 for the vibrations of a mass on a coil spring, except for the notations 
used. That is, the electrical system described by Equation (5.80) is analogous to the 
mechanical system described by Equation (5.7) of Section 5.1. This analogy is brought 
out by Table 5.2. 
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TABLE 5.2 

Mechanical system 

Electrical system 

mass m 

damping constant a 
spring constant k 
impressed force F(t) 
displacement x 
velocity v = dx/dt 

inductance L 
resistance R 

reciprocal of capacitance 1/C 
impressed voltage or emf E 
charge q 

current i = dq/dt 


► Example 5.7 

A circuit has in series an electromotive force given by E = 100 sin 40 1 V, a resistor of 
10 Q and an inductor of 0.5 H. If the initial current is 0, find the curreat at time t > 0. 

Formulation. The circuit diagram is shown in Figure 5.13. Let i denote the current 
in amperes at time t. The total electromotive force is 100 sin 40 1 V. Using the laws 1 and 
2, we find that the voltage drops are as follows: 

1. Across the resistor: E R = Ri = lOi. 

„ , . , „ , di 1 di 

2. Across the inductor: E L = L— = - — . 

dt 2 dt 

Applying Kirchhoff ’s law, have the differential equation 

~+ 10i = 100 sin 40f, 

2 dt 


or 

$ + 201 = 200 sin 40 1. 

dt 


Since the initial current is 0, the initial condition is 

m = o. 


(5.82) 


(5.83) 


Solution. Equation (5.82) is a first-order linear equation. An integrating factor is 

e S 20 it _ g 20t 


R 
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Multiplying (5.82) by this, we obtain 

e 20t % + 20 e 20, i = 200e 2O < sin 40t 
dt 

or 

-7- (e 20t i) = 200e 2Ot sin 40 1. 
dt 

Integrating and simplifying, we find 

i = 2(sin 40r - 2 cos 40 1) + Ce~ 20t . 

Applying the condition (5.83), i = 0 when t = 0, we find C = 4. Thus the solution is 
i = 2(sin 40 1 — 2 cos 40 1) -f 4e~ 20t . 

Expressing the trigonometric terms in a “phase-angle” form, we have 


1 2 

i = 2 x /5( —jz sin 40t -= cos 40 1 ) + 4e~ 10t 

W 5 V 5 


or 


i = 2^/5 sin(40f + 0) + 4e~ 
where </) is determined by the equation 


20t 


(5.84) 


0 


1 


= arccos —7= = arcsin 

75 



We find cj) « — 1.1 1 rad, and hence the current is given approximately by 

i — 4.47 sin(40f — 1.1 1) + 4e~ 20t . 


Interpretation. The current is clearly expressed as the sum of a sinusoidal term and 
an exponential. The exponential becomes so very small in a short time that its effect is 
soon practically negligible; it is the transient term. Thus, after a short time, essentially 
all that remains is the sinusoidal term; it is the steady-state current. Observe that its 
period n/20 is the same as that of the electromotive force. However, the phase angle 
(j) % —1.11 indicates that the electromotive force leads the steady-state current by 
approximately ^ (1.11). The graph of the current as a function of time appears in 
Figure 5.14. 



Figure 5.14 
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► Example 5.8 

A circuit has in series an electromotive force given by E = 100 sin 60 1 V, a resistor of 
2 Q, an inductor of 0.1 H, and a capacitor of jho farads. (See Figure 5.15.) If the initial 
current and the initial charge on the capacitor are both zero, find the charge on the 
capacitor at any time t > 0. 

Formulation 1, by directly applying Kirchhoff's law: Let i denote the current 
and q the charge on the capacitor at time t. The total electromotive force is 
100 sin 60t (volts). Using the voltage drop laws 1, 2, and 3 we find that the voltage drops 
are as follows: 

1. Across the resistor: E R = Ri = 2 i. 

^ „ , di 1 di 

2. Across the inductor: E L = L — = — — . 

L dt 10 dt 

3. Across the capacitor: E c = = 260 q. 

Now applying Kirchhoff’s law we have at once: 

~r ~~ T 2i ~h 260 q = 100 sin 60 1. 

10 dt 

Since i = dq/dt , this reduces to 

1 d 2 q dq 

— -jt + 2 -f + 260 q = 100 sin 60t. (5.85) 

10 dt 2 dt v ' 


Formulation 2, applying Equation (5.80) for the charge: We have L = = 2, 

C = 2 ^ 0 , E = 100 sin 60r. Substituting these values directly into Equation (5.80) we 
again obtain Equation (5.85) at once. 

Multiplying Equation (5.85) through by 10, we consider the differential equation in 
the form 


-jt + 20 + 2600qr = 1000 sin 60r. 

dt 2 dt 


(5.86) 


R = 2 ohms 



C = 260 ^ f ara d s 


Figure 5.15 
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Since the charge q is initially zero, we have as a first initial condition 

q( 0 ) = 0 . (5.87) 

Since the current i is also initially zero and i = dq/dt , we take the second initial 
condition in the form 

q'( 0) = 0. (5.88) 


Solution. The homogeneous equation corresponding to (5.86) has the auxiliary 
equation 


r 2 + 20 r + 2600 = 0. 


The roots of this equation are — 10 ± 50 i and so the complementary function of 
Equation (5.86) is 

q c = e~ 10t (C l sin 50r + C 2 cos 50r). 

Employing the method of undetermined coefficients to find a particular integral of 
(5.86), we write 

q p = A sin 60 1 + B cos 60 1 . 

Differentiating twice and substituting into Equation (5.86) we find that 

A = and B = -ff, 

and so the general solution of Equation (5.86) is 

q = e~ l0t (C l sin 50 1 + C 2 cos 50r) - fy sin 60 1 - fy cos 60 1. (5.89) 

Differentiating (5.89), we obtain 

$ = e~ 10 '[(— lOCt - 50C 2 )sin 50r + (50C, - 10C 2 )cos 50f] 
dt 

cos 60 1 + ^ sin 60 1. (5.90) 

Applying condition (5.87) to Equation (5.89) and condition (5.88) to Equation (5.90), we 
have 


C 2 - t? = 0 and 50^ - 10C 2 - ^ = 0. 
From these equations, we find that 


Ci — 6i and C 2 — 


30 
61 • 


Thus the solution of the problem is 

6e~ 10t . 5 

q = — — — (6 sin 50r + 5 cos 50r) — — (5 sin 60 1 + 6 cos 60 1) 


61 


61 


or 




10 1 


61 


cos(50t — </>) — 


5^61 

61 


cos(60t — 0), 
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where cos 0 = 5/^/6?, sin 4> = 6/^/6! and COS e = 6/ n / 61, sin 6 = 5/^/6 1. From 
these equations we determine </> % 0.88 (radians) and 9 % 0.69 (radians). Thus our 
solution is given approximately by 

q = 0.77e~ 1Ot cos(50 1 - 0.88) - 0.64 cos(60t - 0.69). 

Interpretation. The first term in the above solution clearly becomes negligible after 
a relatively short time; it is the transient term. After a sufficient time essentially all 
that remains is the periodic second term; this is the steady-state term . The graphs of 
these two components and that of their sum (the complete solution) are shown in 
Figure 5.16. 





Figure 5.16 
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Exercises 


1. A circuit has in series a constant electromotive force of 40 V, a resistor of 10 Q, and 
an inductor of 0.2 H. If the initial current is 0, find the current at time t > 0. 

2. Solve Exercise 1 if the electromotive force is given by E(t) = 150 cos 200t V instead 
of the constant electromotive force given in that problem. 

3. A circuit has in series a constant electromotive force of 100 V, a resistor of 10 Q, 
and a capacitor of 2 x 10“ 4 farads. The switch is closed at time t = 0, and the 
charge on the capacitor at this instant is zero. Find the charge and the current at 
time t > 0. 

4. A circuit has in series an electromotive force given by E(t) — 5 sin 1 OOf V, a resistor 
of 10 Q, an inductor of 0.05 H, and a capacitor of 2 x 10“ 4 farads. If the initial 
current and the initial charge on the capacitor are both zero, find the charge on the 
capacitor at any time t > 0. 

5. A circuit has in series an electromotive force given by E(t) = 100 sin 200r V, a 
resistor of 40 Q, an inductor of 0.25 H, and a capacitor of 4 x 10“ 4 farads. If the 
initial current is zero, and the initial charge on the capacitor is 0.01 coulombs, find 
the current at any time t > 0. 

6. A circuit has in series an electromotive force given by E(t ) = 200e “ 1 00t V, a resistor 
of 80 Q, an inductor of 0.2 H, and a capacitor of 5 x 10“ 6 farads. If the initial 
current and the initial charge on the capacitor are zero, find the current at any time 
t > 0. 


7. A circuit has in series a resistor R Q, and inductor of L H , and a capacitor of 
C farads. The initial current is zero and the initial charge on the capacitor is Q 0 
coulombs. 

(a) Show that the charge a nd th e current are damped oscillatory functions of time 
if and only if JR < 2^/L/C, and find the expressions for the charge and the 
current in this case. 

(b) If R > 2y/ L/C, discuss the nature of the charge and the current as functions 
of time. 


8 . 


A circuit has in series an electromotive force given by E(t) = E 0 sin cot V, a resistor 
of R Q, an inductor of L H , and a capacitor of C farads. 

(a) Show that the steady-state current is 



R 


: sin cot 


cos cot 


where X = Leo — 1 /Cco and Z = Jx 2 + R 2 . The quantity X is called the 
reactance of the circuit and Z is called the impedance . 

(b) Using the result of part (a) show that the steady-state current may be written 


i = -j- sin (cot - </>), 


where 0 is determined by the equations 


, R . , * 

cos ^ = z’ sln ^ = z 
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Thus show that the steady-state current attains its maximum absolute value 
£ 0 /Z at times t n + 0/co, where 


t 


n 


1 

CO 


(2 n - l)7t 
2 


(«= 1,2, 3,...), 


are the times at which the electromotive force attains its maximum absolute 
value E 0 . 

(c) Show that the amplitude of the steady-state current is a maximum when 


1 

oo = —=. 

Jlc 

For this value of co electrical resonance is said to occur. 

(d) If R = 20, L = C = 10“ 4 , and E 0 = 100, find the value of co that gives rise 
to electrical resonance and determine the amplitude of the steady-state 
current in this case. 



-CHAPTER SIX= 

Series Solutions of Linear Differential Equations 


In Chapter 4 we learned that certain types of higher-order linear differential equations 
(for example, those with constant coefficients) have solutions that can be expressed as 
finite linear combinations of known elementary functions. In general, however, higher- 
order linear equations have no solutions that can be expressed in such a simple manner. 
Thus we must seek other means of expression for the solutions of these equations. One 
such means of expression is furnished by infinite series representations, and the present 
chapter is devoted to methods of obtaining solutions in infinite series form. 


6.1 POWER SERIES SOLUTIONS ABOUT AN ORDINARY POINT 
A. Basic Concepts and Results 

Consider the second-order homogeneous linear differential equation 

d 2 v dv 

a °^dx^ + a 2 (x)y = 0, (6.1) 

and suppose that this equation has no solution that is expressible as a finite linear 
combination of known elementary functions. Let us assume, however, that it does have 
a solution that can be expressed in the form of an infinite series. Specifically, we assume 
that it has a solution expressible in the form 

00 

c 0 + c t (x - x 0 ) + c 2 (x - x 0 ) 2 + ■■■ = £ c„(x - x 0 ) n , (6.2) 

n = 0 

where c 0 , c l9 c 2 , ... are constants. An expression of the form (6.2) is called a power 
series in x — x 0 . We have thus assumed that the differential equation (6.1) has a so- 
called power series solution of the form (6.2). Assuming that this assumption is valid, we 
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can proceed to determine the coefficients c 0 , c l9 c 2 , . . . in (6.2) in such a manner that the 
expression (6.2) does indeed satisfy the Equation (6.1). 

But under what conditions is this assumption actually valid? That is, under what 
conditions can we be certain that the differential equation (6.1) actually does have a 
solution of the form (6.2)? This is a question of considerable importance; for it would be 
quite absurd to actually try to find a “solution” of the form (6.2) if there were really no 
such solution to be found! In order to answer this important question concerning the 
existence of a solution of the form (6.2), we shall first introduce certain basic definitions. 
For this purpose let us write the differential equation (6.1) in the equivalent normalized 
form 


where 


d 2 

dx 


4 + J» 1 (x)g + P 2 M> = °, 


(6.3) 


Pi(x) = 


a oM 


and 


P 2 (x) = 




DEFINITION 


A function f is said to be analytic at x 0 if its Taylor series about x 0 , 


f f {n) (* o) 

rko It! 


(x - x 0 r, 


exists and converges to f(x) for all x in some open interval including x 0 . 


We note that all polynomial functions are analytic everywhere; so also are the 
functions with values e x , sin x, and cos x. A rational function is analytic except at those 
values of x at which its denominator is zero. For example, the rational function defined 
by l/(x 2 — 3x -f 2) is analytic except at x = 1 and x = 2. 


DEFINITION 

The point x 0 is called an ordinary point of the differential equation (6.1) if both of the 
functions P t and P 2 in the equivalent normalized equation (6.3) are analytic at x 0 . If either 
(or both) of these functions is not anaytic at x 0 , then x 0 is called a singular point of the 
differential equation (6.1). 


► Example 6.1 

Consider the differential equation 

lx^ + x % + {x2 + 2) 3' = 0 - (M 

Here P x (x) = x and P 2 (x) = x 2 + 2. Both of the functions P t and P 2 are polynomial 
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functions and so they are analytic everywhere. Thus all points are ordinary points of 
this differential equation. 


► Example 6.2 


Consider the differential equation 


(x 


d 2 y „ 


dy_ 

dx 


(6.5) 


Equation (6.5) has not been written in the normalized form (6.3). We must first express 
(6.5) in the normalized form, thereby obtaining 


Here 


d 2 y t x dy 
dx 2 x — 1 dx 


1 


x(x — 1) 


y = 0. 


P\( x ) = — ~r and P 2 (x) = - — ~ l 

x — 1 x(x — 1) 

The function P t is analytic, except at x = 1 , and P 2 is analytic except at x = 0 and x = 1 . 
Thus x = 0 and x = 1 are singular points of the differential equation under 
consideration. All other points are ordinary points. Note clearly that x = 0 is a singular 
point, even though P t is analytic at x = 0. We mention this fact to emphasize that both 
P 1 and P 2 must be analytic at x 0 in order for x 0 to be an ordinary point. 

We are now in a position to state a theorem concerning the existence of power series 
solutions of the form (6.2). 


THEOREM 6.1 

Hypothesis. The point x 0 is an ordinary point of the differential equation {6.1). 

Conclusion. The differential equation ( 6.1 ) has two nontrivial linearly independent 
power series solutions of the form 

Z c n(x - X 0 ) n , (6.2) 

11 = 0 

and these power series converge in some interval |x — x 0 | < R {where R > 0) about x 0 . 

This theorem gives us a sufficient condition for the existence of power series 
solutions of the differential equation (6.1). It states that if x 0 is an ordinary point of 
equation (6.1), then this equation has two power series solutions in powers of x - x 0 
and that these two power series solutions are linearly independent. Thus if x 0 is an 
ordinary point of (6.1), we may obtain the general solution of (6.1) as a linear 
combination of these two linearly independent power series. We shall omit the proof of 
this important theorem. 
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► Example 6.3 

In Example 6.1 we noted that all points are ordinary points of the differential equa- 
tion (6.4). Thus this differential equation has two linearly independent solutions of the 
form (6.2) about any point x 0 . 


► Example 6.4 

In Example 6.2 we observed that x = 0 and x = 1 are the only singular points of the 
differential equation (6.5). Thus this differential equation has two linearly independent 
solutions of the form (6.2) about any point x 0 # 0 or 1. For example, the equation has 
two linearly independent solutions of the form 

00 

I C„(x - 2)" 

n = 0 

about the ordinary point 2. However, we are not assured that there exists any solution 
of the form 


about the singular point 0 or any solution of the form 

Z C n( X - 1)" 

n = 0 

about the singular point 1. 


B. The Method of Solution 

Now that we are assured that under appropriate hypotheses Equation (6.1) actually 
does have power series solutions of the form (6.2), how do we proceed to find these 
solutions? In other words, how do we determine the coefficients c 0 , c t , c 2 , ... in the 
expression 


Z C„(x - X 0 )" (6.2) 

M = 0 

so that this expression actually does satisfy Equation (6.1)? We shall first give a brief 
outline of the procedure for finding these coefficients and shall then illustrate the 
procedure in detail by considering specific examples. 

Assuming that x 0 is an ordinary point of the differential equation (6.1), so that 
solutions in powers of x — x 0 actually do exist, we denote such a solution by 

00 

y = c 0 + Ci(x - x 0 ) + c 2 (x - x 0 ) 2 + ■■■ = Z c n (x - x 0 ) n . (6.6) 

n = 0 

Since the series in (6.6) converges on an interval |x — x 0 | < R about x 0 , it may be 
differentiated term by term on this interval twice in succession to obtain 

dy 00 

— = Ci+ 2 c 2 (x - x 0 ) + 3c 3 (x - x 0 ) 2 + • • • = Z nc n( x - x o)"~ l 

dx n= 1 


(6.7) 
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and 

d ^ V ^ 

t 4 = 2 c 2 + 6 c 3 (x - x 0 ) + 12 c 4 (x - x 0 ) 2 + • • • = £ n(n - l)c„(x - x 0 )"~ 2 , 

ax n = 2 

(6.8) 

respectively. We now substitute the series in the right members of (6.6), (6.7), and (6.8) 
for y and its first two derivatives, respectively, in the differential equation (6.1). We then 
simplify the resulting expression so that it takes the form 

Ko + K t (x- x 0 ) + K 2 (x - x 0 ) 2 + • • • = 0, (6.9) 

where the coefficients K t (i = 0,1,2,...) are functions of certain coefficients c n of 
the solution (6.6). In order that (6.9) be valid for all x in the interval of convergence 
|x - x 0 | < JR, we must set 

K 0 = K y = K 2 =' = 0. 

In other words, we must equate to zero the coefficient of each power of x — x 0 in the left 
member of (6.9). This leads to a set of conditions that must be satisfied by the various 
coefficients c n in the series (6.6) in order that (6.6) be a solution of the differential 
equation (6.1). If the c„ are chosen to satisfy the set of conditions that thus occurs, then 
the resulting series (6.6) is the desired solution of the differential equation (6.1). We shall 
illustrate this procedure in detail in the two examples which follow. 


► Example 6.5 

Find the power series solution of the differential equation 

U' +x ^ + ^ +2), -° (64) 

in powers of x (that is, about x 0 = 0). 

Solution. We have already observed that x 0 = 0 is an ordinary point of the 
differential equation (6.4) and that two linearly independent solutions of the desired 
type actually exist. Our procedure will yield both of these solutions at once. 

We thus assume a solution of the form (6.6) with x 0 = 0. That is, we assume 

y = I c„x". (6.10) 

n = 0 

Differentiating term by term we obtain 

dv 

— = £ nc n x"~ l (6.11) 

ax n= 1 

and 


d 2 y 

d: J 


n = 2 


n(n — l)c„x n 2 . 


(6.12) 


Substituting the series (6.10), (6.11), and (6.12) into the differential equation (6.4), we 
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obtain 


£ n(n - 1 )c n x n 2 + x £ nc n x n 1 + x I 2 £ c„x n + 2 £ c„x" = 0. 

« = 2 n=l m = 0 « = 0 

Since x is independent of the index of summation n, we may rewrite this as 


E «(« — l)c„x" 2 + £ nc„x" + £ c„x n+2 + 2 £ c n x " = 0. (6.13) 

« — 2 n = 1 n = 0 n = 0 

In order to put the left member of Equation (6.13) in the form (6.9), we shall rewrite 
the first and third summations in (6.13) so that x in each of these summations will have 
the exponent n. Let us consider the first summation 


£ n(n - 1 )c„x n 2 (6.14) 

n = 2 

in (6.13). To rewrite the summation (6.14) so that x will have the desired exponent n , we 
first replace the present exponent n — 2 in (6.14) by a new variable m. That is, we let 
m = n — 2 in (6.14). Then n = m + 2, and since m = 0 for n = 2, the summation (6.14) 
takes the form 


£ (m + 2 )(m + l)c m+2 x m . (6.15) 

m = 0 

Now since the variable of summation is merely a “dummy” variable, we may replace m 
by n in (6.15) to write the first summation in (6.13) as 

£ (n + 2)(n + l)c„ +2 x". (6.16) 

11 = 0 

In like manner, letting m = n + 2, the third summation 


I c„x" +2 (6.17) 

« = 0 

in (6.13) first takes the form 


00 


I C m _ 2 x" (6.18) 

m = 2 

Then replacing m by n in (6.18), the third summation in (6.13) may be written as 


c„-?x" 


n — 2 


(6.19) 


Thus replacing (6.14) by its equivalent (6.16) and (6.17) by its equivalent (6.19), 
Equation (6.13) may be written 


£ (« + 2)(n + l)c„ +2 x" + 

n = 0 


oo 

E 


nc„x n + X c„_ 2 x" 

n = 2 


+ 2 E c„x" = 0. 

n = 0 


( 6 . 20 ) 


Although x has the same exponent n in each summation in (6.20), the ranges of the 
various summations are not all the same. In the first and fourth summations n ranges 
from 0 to oo, in the second n ranges from 1 to oo, and in the third the range is from 2 to 
oo. The common range is from 2 to oo. We now write out individually the terms in each 
summation that do not belong to this common range, and we continue to employ the 
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“sigma” notation to denote the remainder of each such summation. For example, in the 
first summation 


£ ( n + 2 H n + !) c » + 2*" 


n = 0 


of (6.20) we write out individually the terms corresponding to n = 0 and n = 1 and 
denote the remainder of this summation by 


We thus rewrite 


in (6.20) as 


Z (K + 2 )(« + 1 )c n+2 x". 


I (» + 2)(n + l)c n + 2 X ” 

/i = 0 


2c 2 + 6c 3 x + £ (n + 2)(« + l)c n+2 x". 

n = 2 


In like manner, we write 


in (6.20) as 


and 


£ nc„x" 
1 


c,x+ £ nc„x" 

n = 2 


2 £ c„x n 

M = 0 


in (6.20) as 


2 c 0 + 2c t x + 2 Z c n x "- 

n = 2 

Thus Equation (6.20) is now written as 

oo oo 

2c 2 + 6c 3 x + Z ( n + 2)(n + l)c„ + 2 x" + c t x + Z nc„x" 

n = 2 n= 2 

oo oo 

+ £ C„_ 2 x" + 2c 0 + 2c t x + 2 Z c n x " = 0. 

n = 2 n = 2 

We can now combine like powers of x and write this equation as 
(2 c 0 + 2c 2 ) + (3c ! + 6c 3 )x +£[(" + 2 )(” + l ) c »+ 2 + (« + 2)c„ + c„_ 2 ]x" = 0. 

n = 2 

(6.21) 

Equation (6.21) is in the desired form (6.9). For (6.21) to be valid for all x in the 
interval of convergence | x — x 0 1 < R, the coefficient of each power of x in the left 
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member of (6.21) must be equated to zero. This leads immediately to the conditions 

2c 0 + 2 c 2 = 0, (6.22) 

3c, + 6 c 3 = 0, (6.23) 

and 

(n + 2 )(n + l)c„ + 2 + (ft + 2 )c n + c„_ 2 = 0, n> 2. (6.24) 

The condition (6.22) enables us to express c 2 in terms of c 0 . Doing so, we find that 

c 2 =-c 0 . (6.25) 

The condition (6.23) enables us to express c 3 in terms of c i . This leads to 

c 3 = — 2 c i- (6.26) 

The condition (6.24) is called a recurrence formula. It enables us to express each 
coefficient c n + 2 for n > 2 in terms of the previous coefficients c n and c„_ 2 , thus giving 

_ (n + 2)c„ + c„_ 2 M ^ o ~n\ 

c n + : 

For n = 2, formula (6.27) is 


(n + 1 )(n + 2) ’ 




^ 4 = — 


4c 2 + c 0 
12 * 


Now using (6.25), this reduces to 

c 4 = ic 0 , (6.28) 

which expresses c 4 in terms of c 0 . For n = 3, formula (6.27) is 

5c 3 + c l 


Now using (6.26), this reduces to 

c 5 = 4o c i> (6.29) 

which expresses c 5 in terms of c t . In the same way we may express each even coefficient 
in terms of c 0 and each odd coefficient in terms of c t . 

Substituting the values of c 2 , c 3 , c 4 , and c 5 , given by (6.25), (6.26), (6.28), and (6.29), 
respectively, into the assumed solution (6.10), we have 

y = c 0 + c t x — c 0 x 2 — + ic 0 x 4 + ^c x x 5 + • • •. 

Collecting terms in c 0 and c i9 we have finally 

y = c 0 (l - x 2 + |x 4 + •••) + Ci(* ~ t* 3 + + *••)» ( 6 *30) 

which gives the solution of the differential equation (6.4) in powers of x through terms 
in x 5 . The two series in parentheses in (6.30) are the power series expansions of two 
linearly independent solutions of (6.4), and the constants c 0 and c t are arbitrary 
constants. Thus (6.30) represents the general solution of (6.4) in powers of x (through 
terms in x 5 ). 
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► Example 6.6 

Find a power series solution of the initial-value problem 

(x 2_ l) <pL + 3x d JL + xy = 0 , (6.31) 

dx ax 

3*0) = 4, (6.32) 

y'( 0) = 6. (6.33) 

Solution. We first observe that all points except x = ± 1 are ordinary points for the 
differential equation (6.31). Thus we could assume solutions of the form (6.6) for any 
x 0 # ±1. However, since the initial values of y and its first derivative are prescribed at 
x = 0, we shall choose x 0 = 0 and seek solutions in powers of x. Thus we assume 

y = z c « x ”- (6 34) 

n = 0 

Differentiating term by term we obtain 

dv ® 

j-= X " c « x " ( 6 - 35 ) 

dx „ = , 

and 

jT= Z »(»“ l )c n x"~ 2 . (6.36) 

dx „= 2 

Substituting the series (6.34), (6.35), and (6.36) into the differential equation (6.31), we 
obtain 


Z n{n - 1 )c n x n — £ ri(n — 1 )c n x n 2 + 3 Z nc n x n + Z c n x " + l = 0. (6.37) 

n- 2 n — 2 h — 1 n = 0 

We now rewrite the second and fourth summations in (6.37) so that x in each of these 
summations has the exponent n. Doing this, Equation (6.37) takes the form 

oo oo oo oo 


X n(n - 1 )c n x" - £ (n + 2 )(n + 1 )c n + 2 x" + 3 ^ nc„x" 

n = 2 n = 0 n = 1 


+ X C n-l X ” = 0. 

It = 1 


(6.38) 


The common range of the four summations in (6.38) is from 2 to oo. We can write out 
the individual terms in each summation that do not belong to this common range and 
thus express (6.38) in the form 


£ n(n - 1 )c n x n - 2c 2 - 6 c 3 x - £ (n + 2 )(n + 1 )c„ 


n = 2 


n = 2 


+ 2 ' 


+ 3 c t x + 3 Z nc n x n + c 0 x + Z c n-i x n = ^ 

n = 2 n = 2 


Combining like powers of x, this takes the form 
— 2 c 2 + (c 0 + 3c ! — 6c 3 )x 


+ £ [“(» + 2 )(« + !) c n +2 + «(» + 2 )c„ + c„_,]x" = 0. (6.39) 

n = 2 
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For (6.39) to be valid for all x in the interval of convergence \x — x 0 \ < R, the 
coefficient of each power of x in the left member of (6.39) must be equated to zero. In 
doing this, we are led to the relations 


and 


— 2c 2 = 0, 
c 0 + 3 c i — 6 c 3 = 0, 


(6.40) 

(6.41) 


— {n + 2 )(n + l)c n + 2 + n(n + 2 )c n + c n - i = 0, n > 2. (6.42) 


From (6.40), we find that c 2 = 0; and from (6.41), c 3 =£c 0 + 2 c i- The recurrence 
formula (6.42) gives 


n{n + 2 )c n 
(n + 1 )(n + 2) ’ 


Using this, we find successively 


n> 2. 


c 4 


C 5 


8 C 3 “h Cj 1 

12 = 12 Cl ’ 

15c 3 + c 2 1 3 

20 _ 8 Co + 8 Cl ' 


Substituting these values of c 2 , c 3 , c 4 , c 5 , . . . into the assumed solution (6.34), we have 


y = c 0 + c l x + 


£o , Ci\ 
6 2 ) 


x j +t^x* + Hr + 


12 


c 0 , 3c, 


8 8 


x 3 + • • • 


or 


y = c 0 (l + 5X 3 + 5X 5 + •••) + Cj(x + ^x 3 + tjx 4 + fx 5 + •••)• (6.43) 

The solution (6.43) is the general solution of the differential equation (6.31) in powers of 
x (through terms in x 5 ). 

We must now apply the initial conditions (6.32) and (6.33). Applying (6.32) to (6.43), 
we immediately find that 

c 0 = 4. 

Differentiating (6.43), we have 

~ = c 0 (jx 2 + fx 4 + • • •) + c,(l + fx 2 + 3X 3 + ^x 4 + •••)• (6.44) 


Applying (6.33) to (6.44) we find that 

c, = 6. 

Thus the solution of the given initial-value problem in powers of x (through terms in 
x 5 ) is 

y = 4(1 + £x 3 + £x 5 + •••) + 6(x + ^x 3 + t^x 4 + f x 5 + • • •) 
or 

y = 4 + 6x + -y-x 3 + jx 4 + Vx 5 + • • •• 
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Remark 7 . Suppose the initial values of y and its first derivative in conditions (6.32) 
and (6.33) of Example 6.6 are prescribed at x = 2, instead of x = 0. Then we have the 
initial-value problem 


& 

dx 2 


(* 2 “ l ) tt + 3x + x y = °> 


dy_ 

dx 


y( 2) = 4, /( 2) = 6. 


(6.45) 


Since the initial values in this problem are prescribed at x = 2, we would seek solutions 
in powers of x — 2. That is, in this case we would seek solutions of the form 


y = 


oo 


z 

■ = r 


c„(x - 2)". 


(6.46) 


The simplest procedure for obtaining a solution of the form (6.46) is first to make the 
substitution t = x — 2. This replaces the initial-value problem (6.45) by the equivalent 
problem 


(t 2 + 4t + 3) + (3t + 6) + (t + 2 )y = 0, 

y( 0) = 4, /( 0) = 6, 


(6.47) 


in which t is the independent variable and the initial values are prescribed at t = 0. One 
then seeks a solution of the problem (6.47) in powers of t , 


y= t *«*"• (6.48) 

M = 0 

Differentiating (6.48) and substituting into the differential equation in (6.47), one 
determines the c n as in Examples 6.5 and 6.6. The initial conditions in (6.47) are then 
applied. Replacing t by x — 2 in the resulting solution (6.48), one obtains the desired 
solution (6.46) of the original problem (6.45). 


Remark 2. In Examples 6.5 and 6.6 we obtained power series solutions of the 
differential equations under consideration but made no attempt to discuss the 
convergence of these solutions. According to Theorem 6.1, if x 0 is an ordinary point of 
the differential equation 

d 2 y dy 

«°M^y + a ^ X>j dx + Q2 ^ y = °’ ^ 

then the power series solutions of the form (6.2) converge in some interval | x — x 0 1 < R 
(where R > 0) about x 0 . Let us again write (6.1) in the normalized form 

^+P l (x)^+P 2 (x)y = 0, (6.3) 


where 


p .w = 


a 0 {x) 


and 


P 2 (x) = 


a 2 (x) 

a 0 (x)' 


If x 0 is an ordinary point of (6.1), the functions P, and P 2 have Taylor series expansions 
about x 0 that converge in intervals |x — x 0 | < R t and |x — x 0 | < R 2 , respectively, 
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about x 0 . It can be proved that the interval of convergence |x - x 0 | < R of a series 
solution (6.2) of (6.1) is at least as great as the smaller of the intervals |x — x 0 \ < R t and 
I x - x 0 | < K 2 - 

In the differential equation (6.4) of Example 6.5, P t (x) = x and P 2 (x) = x 2 + 2. Thus 
in this example the Taylor series expansions for P t and P 2 about the ordinary point 
x 0 = 0 converge for all x. Hence the series solutions (6.30) of (6.4) also converge for 
all x. 

In the differential equation (6.31) of Example 6.6, 

Pi(x) = p~T and = 

In this example the Taylor series for P l and P 2 about x 0 = 0 both converge for |x| < 1. 
Thus the solutions (6.43) of (6.31) converge at least for |x| < 1. 

Exercises 



Find the power series solution of each of the initial-value problems in Exercises 11-14. 

H. ^- x ^-y = 0 ’ m-u /(0) = 0. 

,2 - & + 4 - 2f=o - 

13. (x 2 + 1)0 + x£ + 2x^ = 0, y( 0) = 2, /(0) = 3. 

14. (2x 2 -3)0-2x£ + ); = O, y(0) = - 1, /(0) = 5. 



6.2 SOLUTIONS ABOUT SINGULAR POINTS; THE METHOD OF FROBENIUS 233 


Find power series solutions in powers of x — 1 of each of the differential equations in 
Exercises 15 and 16. 


A i2 y . ~ d y 


15 - * + x —. + y = °- 


dx 2 dx 


16. x 2 ^ + 3x^- y = 0. 


dx 2 dx 

17. Find the power series solution in powers of x - 1 of the initial-value problem 

x i^ + ^ +2y=o ’ ym = 2 - m=4 - 


18. The differential equation 


A 2 y 


(1 - x 2 ) -j-4- -2x-j L + n(n + l)y = 0, 
dx dx 


where n is a constant, is called Legendre's differential equation . 

(a) Show that x = 0 is an ordinary point of this differential equation, and find 
two linearly independent power series solutions in powers of x. 

(b) Show that if n is a nonnegative integer, then one of the two solutions found in 
part (a) is a polynomial of degree n. 


6.2 SOLUTIONS ABOUT SINGULAR POINTS; THE METHOD OF FROBENIUS 
A. Regular Singular Points 

We again consider the homogeneous linear differential equation 

a o(x)j^ + a l (x)j^ + a 2 {x)y = 0, (6.1) 

and we assume that x 0 is a singular point of (6.1). Then Theorem 6.1 does not apply at 
the point x 0 , and we are not assured of a power series solution 

y = Z c n(x - x 0 y (6.2) 

n = 0 

of (6.1) in powers of x — x 0 . Indeed an equation of the form (6.1) with a singular point 
at x 0 does not , in general, have a solution of the form (6.2). Clearly we must seek a 
different type of solution in such a case, but what type of solution can we expect? It 
happens that under certain conditions we are justified in assuming a solution of the 
form 


y = \x — x 0 | r Z c n( x - x o)\ (6-49) 

n = 0 

where r is a certain (real or complex) constant. Such a solution is clearly a power series 
in x — x 0 multiplied by a certain power of | x — x 0 1 . In order to state conditions under 
which a solution of this form is assured, we proceed to classify singular points. 
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We again write the differential equation (6.1) in the equivalent normalized form 

i2 y , o ,^ d y 


-j^ + P l (x)^ + P 2 (x)y = 0, 


dx 


dx 


(6.3) 


where 


and 


a 0 (x) 


a 0 (x) 


DEFINITION 

Consider the differential equation (6.1), and assume that at least one of the functions P 1 
and P 2 in the equivalent normalized equation (6.3) is not analytic at x 0 , so that x 0 is a 
singular point of (6.1). If the functions defined by the products 

(x — x 0 )P i (x) and (x — x 0 ) 2 P 2 (x) (6.50) 

are both analytic at x 0 , then x 0 is called a regular singular point of the differential 
equation (6.1). If either (or both) of the functions defined by the products (6.50) is not 
analytic at x 0 , then x 0 is called an irregular singular point of (6.1). 


► Example 6.7 

Consider the differential equation 

2x, 7^- x ^ + (x ~ 5), = a (65,) 

Writing this in the normalized form (6.3), we have 

d 2 y 1 dy x - 5 
dx 2 2x dx + 2x 2 ^ 

HereP^x) = - l/2xandP 2 (x) = (x - 5)/2x 2 . Since both P 1 and P 2 fail to be analytic at 
x = 0, we conclude that x = 0 is a singular point of (6.51). We now consider the 
functions defined by the products 

1 x — 5 

xPi(x ) = -- and x 2 P 2 (x) = — 

of the form (6.50). Both of these product functions are analytic at x = 0, and so x = 0 is 
a regular singular point of the differential equation (6.51). 


► Example 6.8 


Consider the differential equation 

x 2 (x — 2) 2 + 2(x - 2) + (x + l)y = 0. 


(6.52) 
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In the normalized form (6.3), this is 

d 2 y 2 dy x + 1 

dx 2 + x 2 (x — 2) dx + x 2 (x - 2) 2 ^ 

Here 


Pi(x) = 


2 

x 2 (x - 2) 


and 


P 2 (x) = 


x + 1 

x 2 (x - l) 2 ' 


Clearly the singular points of the differential equation (6.52) are x = 0 and x = 2. We 
investigate them one at a time. 

Consider x = 0 first, and form the functions defined by the products 


xP t (x) = 


2 

x(x - 2) 


and 


x 2 P 2 (x) = 


X 1 

(x - 2) 2 


of the form (6.50). The product function defined by x 2 P 2 {x) is analytic at x = 0, but that 
defined by xP l (x) is not. Thus x = 0 is an irregular singular point of (6.52). 

Now consider x = 2. Forming the products (6.50) for this point, we have 

(x - 2)P , (x) = T and (x - 2) 2 P 2 (x) = 


Both of the product functions thus defined are analytic at x = 2, and hence x = 2 is a 
regular singular point of (6.52). 


Now that we can distinguish between regular and irregular singular points, we shall 
state a basic theorem concerning solutions of the form (6.49) about regular singular 
points. We shall later give a more complete theorem on this topic. 


THEOREM 6.2 

Hypothesis. The point x 0 is a regular singular point of the differential equation {6.1). 

Conclusion. The differential equation {6.1) has at least one nontrivial solution of the 
form 

|x-Xol r Z ^(x-Xo)". (6-49) 

n = 0 

where r is a definite {real or complex) constant which may be determined , and this solution 
is valid in some deleted interval 0 < | x — x 0 1 < R {where R > 0) about x 0 . 


► Example 6.9 

In Example 6.7 we saw that x = 0 is a regular singular point of the differential equation 
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By Theorem 6.2 we conclude that this equation has at least one nontrivial solution of 
the form 


i*r Z c n 

n = 0 


valid in some deleted interval 0 < |x| < R about x = 0. 


► Example 6.10 

In Example 6.8 we saw that x = 2 is a regular singular point of the differential equation 

x 2 (x - 2) 2 ^4 + 2(x - 2) ^ + (x + 1 )y = 0. (6.52) 

ax ax 

Thus we know that this equation has at least one nontrivial solution of the form 

\x-2\ r t c„(x — 2)", 

n = 0 

valid in some deleted interval 0 < | x — 2 1 < jR about x = 2. 

We also observed that x = 0 is a singular point of Equation (6.52). However, this 
singular point is irregular and so Theorem 6.2 does not apply to it. We are not assured' 
that the differential equation (6.52) has a solution of the form 

1*1- Z C nX" 

n = 0 

in any deleted interval about x = 0. 


B. The Method of Frobenius 

Now that we are assured of at least one solution of the form (6.49) about a regular 
singular point x 0 of the differential equation (6.1), how do we proceed to determine the 
coefficients c n and the number r in this solution? The procedure is similar to that 
introduced in Section 6.1 and is commonly called the method of Frobenius . We shall 
briefly outline the method and then illustrate it by applying it to the differential 
equation (6.51). In this outline and the illustrative example that follows we shall seek 
solutions valid in some interval 0 < x — x 0 < R. Note that for all such x, |x — x 0 | is 
simply x - x 0 . To obtain solutions valid for - R < x — x 0 < 0, simply replace x — x 0 
by — (x - x 0 ) > 0 and proceed as in the outline. 


Outline of the Method of Frobenius 

1. Let x 0 be a regular singular point of the differential equation (6.1), seek solutions 
valid in some interval 0 < x — x 0 < R, and assume a solution 

oo 

y = {x- x 0 ) r £ c„(x - x 0 ) n 

n = 0 



6.2 SOLUTIONS ABOUT SINGULAR POINTS; THE METHOD OF FROBENIUS 237 


of the form (6.49), where c 0 # 0. We write this solution in the form 

y= I c n (x-x 0 y + \ (6.53) 

M — 0 

where c 0 # 0. 

2. Assuming term-by-term differentiation of (6.53) is valid, we obtain 

= Z (« + r ) c n(* - *o)' ,+r “ I (6-54) 

ax n = 0 


and 

tt= Z (« + »•)(« + r - l)c„(x - x 0 ) n+r " 2 . (6.55) 

ax n = 0 

We now substitute the series (6.53), (6.54), and (6.55) for y and its first two derivatives, 
respectively, into the differential equation (6.1). 

3. We now proceed (essentially as in Section 6.1) to simplify the resulting expres- 
sion so that it takes the form 

K 0 (x - x 0 ) r+k + K t (x - x 0 ) r+k+l + K 2 (x - x 0 ) r+fc + 2 + • • • = 0, (6.56) 

where k is a certain integer and the coefficients K^i = 0, 1,2,.. .) are functions of r and 
certain of the coefficients c n of the solution (6.53). 

4. In order that (6.56) be valid for all x in the deleted interval 0 < x - x 0 < JR, we 
must set 


K 0 = K l = K 2 =' = 0. 

5. Upon equating to zero the coefficient K 0 of the lowest power r + k of (x — x 0 ), we 
obtain a quadratic equation in r, called the indicial equation of the differential equation 
(6.1). The two roots of this quadratic equation are often called the exponents of the 
differential equation (6.1) and are the only possible values for the constant r in the 
assumed solution (6.53). Thus at this stage the “unknown” constant r is determined. We 
denote the roots of the indicial equation by r { and r 2 , where Re(r t ) > Re(r 2 ). Here 
Re(ry) denotes the real part of rj(j = 1,2); and of course if is real, then R .e(rj) is 
simply rj itself. 

6. We now equate to zero the remaining coefficients , K 2 , . . . in (6.56). We are thus 
led to a set of conditions, involving the constant r, which must be satisfied by the 
various coefficients c n in the series (6.53). 

7. We now substitute the root r t for r into the conditions obtained in Step 6, and then 
choose the c n to satisfy these conditions. If the c n are so chosen, the resulting series (6.53) 
with r = r Y is a solution of the desired form. Note that if r l and r 2 are real and unequal, 
then r x is the larger root. 

8. If r 2 # r l9 we may repeat the procedure of Step 7 using the root r 2 instead of r t . 
In this way a second solution of the desired form (6.53) may be obtained. Note that iff 
r i and r 2 are real and unequal, then r 2 is the smaller root. However, in the case in which 
r l and r 2 are real and unequal, the second solution of the desired form (6.53) obtained in 
this step may not be linearly independent of the solution obtained in Step 7. Also, in the 
case in which r t and r 2 are real and equal, the solution obtained in this step is clearly 
identical with the one obtained in Step 7. We shall consider these “exceptional” 
situations after we have considered an example. 
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► Example 6.11 

Use the method of Frobenius to find solutions of the differential equation 

2x 2 ^-x^- + (x-5)y = 0 (6.51) 

dx z ax 

in some interval 0 < x < R. 


Solution. Since x = 0 is a regular singular point of the differential equation (6.51) 
and we seek solution for 0 < x < R, we assume 


where c 0 # 0. Then 


and 


y = Yj C„X" + r , 

n = 0 


J- = Z (« + r)c„x n+r 1 
ax n = 0 


pj= t (n + r)(n + r- l)c n x" +l - 2 . 
ax n =0 

Substituting the series (6.57), (6.58), and (6.59) into (6.51), we have 


(6.57) 


(6.58) 


(6.59) 


2 Y {n + r)(n + r — l)c n x n+r — Y ( n + r)c„x n+r + Y c n x" +r+l — 5 Y c n x" +r = 0. 

n = 0 n = 0 m — 0 m — 0 

Simplifying, as in the examples of Section 6.1, we may write this as 


£ [2(n + r)(n + r - 1) — (n + r) — 5]c„x" +r + Y c n-iX” +r = 0 

n = 0 n = 1 


or 


[2r(r - 1) - r - 5]c 0 x r 

+ I {[2 (n + r)(n + r - 1) - (n + r) - 5]c„ + c n . t }x n+r = 0. (6.60) 


This is of the form (6.56), where k = 0. 

Equating to zero the coefficient of the lowest power of x (that is, the coefficient of x r ) 
in (6.60), we are led to the quadratic equation 


2 r(r — 1) — r — 5 = 0 


(since we have assumed that c 0 # 0). This is the indicial equation of the differential 
equation (6.51). We write it in the form 

2r 2 — 3r — 5 = 0 

and observe that its roots are 

r i = f and r 2 = - 1. 
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These are the so-called exponents of the differential equation (6.51) and are the only 
possible values for the previously unknown constant r in the solution (6.57). Note that 
they are real and unequal. 

Equating to zero the coefficients of the higher powers of x in (6.60), we obtain the 
recurrence formula 

[2 (n + r)(n + r - 1) - (n + r) - 5 ]c„ + c„_ t = 0, n> 1. (6.61) 

Letting r = r t = f in (6.61), we obtain the recurrence formula 

[2 (» + f)(n + !)-(« + f) -5]c„ + c„_, = 0, n> l, 
corresponding to the larger root f of the indicial equation. This simplifies to the form 


or, finally, 


n(2n + l)c n + c n . t = 0, n > 1, 


c 


n 


Using (6.62) we find that 


c 


n-l 


n(2n + iy 


n > 1. 


Cl = 


Co 

9’ 


c l _ C Q 

22 ~ 198’ 



_Co_ 

7722’ 


(6.62) 


Letting r = f in (6.57), and using these values of c t , c 2 , c 3 , . . . , we obtain the solution 
y = c 0 (x 512 - ^ + jhx 912 - 7 fex 11/2 + • • •) 

= c 0 x 5/2 (l - ijX + -iigX 2 - rfnx 3 + •••), ( 6 - 63 ) 

corresponding to the larger root r i = f . 

We now let r = r 2 = — 1 in (6.61) to obtain the recurrence formula 

[2 (n - 1 )(n -2) -(n-l)- 5 ]c„ + c JJ . 1 =0, n > 1, 

corresponding to this smaller root of the indicial equation. This simplifies to the form 

n(2n - l)c n + c„_! = 0, n > 1, 

or finally 


Using this, we find that 

c \ = 5 C 0? 


c „- 1 

n(2n - 7)’ 


n > 1. 


c 2 — 6 C 1 — 30 c 0> 




3^2 


90 C 0 , 


Letting r = - 1 in (6.57), and using these values of c l5 c 2 , c 3 ,... 

y = c 0 (x " 1 + | + JoX + £>x 2 -■■ ■) 

= CqX~ Hi + ix + ioX 2 + ^X 3 - • • •)> 


we obtain the solution 
(6.64) 


corresponding to the smaller exponent r 2 — — 1. 

The two solutions (6.63) and (6.64) corresponding to the two roots f and — 1, 
respectively, are linearly independent. Thus the general solution of (6.51) may be 
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written 

y = Qx 5 ' 2 ^ - £x + J^gX 2 - rmx 3 + •••)+ C 2 X~ 1 (l + jx + ^jx 2 + <&x 3 ), 

where C t and C 2 are arbitrary constants. 

Observe that in Example 6.11 two linearly independent solutions of the form (6.49) 
were obtained for x > 0. However, in Step 8 of the outline preceding Example 6.11, 
we indicated that this is not always the case. Thus we are led to ask the following 
questions: 

1. Under what conditions are we assured that the differential equation (6.1) has two 
linearly independent solutions 

oo 

l*-Xo| r Z C n(x-X o)" 

n = 0 

of the form (6.49) about a regular singular point x 0 ? 

2. If the differential equation (6.1) does not have two linearly independent solu- 
tions of the form (6.49) about a regular singular point x 0 , then what is the form of a 
solution that is linearly independent of the basic solution of the form (6.49)? 

In answer to these questions we state the following theorem. 

THEOREM 6.3 

Hypothesis. Let the point x 0 be a regular singular point of the differential equation 
(6.1). Let r v and r 2 [ where Re(r x ) > Re(r 2 )] be the roots of the indicial equation 
associated with x 0 . 

Conclusion 1. Suppose r t — r 2 / N, where N is a nonnegative integer (that is , 
r x — r 1 ¥ 1 0, 1,2,3,...). Then the differential equation (6.1) has two nontrivial linearly 
independent solutions y t and y 2 of the form (6.49) given respectively by 

yi M = \x- x 0 p Z c »( x ~ x o )"> (6-65) 

11 = 0 

where c 0 # 0, and 

y 2 (x) = |x - x 0 p Z c * ( x ~ *o) n > (6-66) 

n = 0 

where c% # 0. 

Conclusion 2. Suppose r t — r 2 = N, where N is a positive integer. Then the 
differential equation (6.1) has two nontrivial linearly independent solutions y i and y 2 given 
respectively by 

3 'i(x) = |x-x o r Z c„(x - x 0 ) n , (6.65) 

n = 0 

where c 0 # 0, and 

y 2 (x) = |x - x 0 \' 2 Z c*(x - x oY + Qi Win |x - x 0 |, 

n = 0 

where eg # 0 and C is a constant which may or may not be zero. 


(6.67) 
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Conclusion 3. Suppose r t — r 2 = 0. Then the differential equation (6.1) has two 
nontrivial linearly independent solutions y 1 and y 2 given respectively by 

y 1 (x) = |x-XoP X c„(x-x 0 )", (6.65) 

n = 0 

where c 0 # 0, and 

hW = \x - x 0 r , + 1 X c ?(x — Xo)" + ,yi(x)ln |x — x 0 |. (6.68) 

M — 0 

The solutions in Conclusions 7, 2, and 3 are valid in some deleted interval 0 < 

| x — x 0 1 < JR about x 0 . 

In the illustrative examples and exercises that follow, we shall again seek solutions 
valid in some interval 0 < x — x 0 < R. We shall therefore discuss the conclusions 
of Theorem 6.3 for such an interval. Before doing so, we again note that if 0 < 
x — x 0 < JR, then | x — x 0 1 is simply x — x 0 . 

From the three conclusions of Theorem 6.3 we see that if x 0 is a regular singular 
point of (6.1), and 0 < x — x 0 < JR, then there is always a solution 

00 

y t (x) = (x - x 0 r x c »(* - x o) n 

n = 0 

of the form (6.49) for 0 < x - x 0 < JR corresponding to the root r l of the indicial 
equation associated with x 0 . Note again that the root r t is the larger root if r x and r 2 are 
real and unequal. From Conclusion 1 we see that if 0 < x — x 0 < JR and the difference 
r i — r 2 between the roots of the indicial equation is not zero or a positive integer, 
then there is always a linearly independent solution 

oo 

_y 2 (x) = (x - x 0 p X c * ( x ~ x o )" 

n = 0 

of the form (6.49) for 0 < x - x 0 < JR corresponding to the root r 2 . Note that the root 
r 2 is the smaller root if r Y and r 2 are real and unequal. In particular, observe that if r i 
and r 2 are conjugate complex, then r t - r 2 is pure imaginary, and there will always be a 
linearly independent solution of the form (6.49) corresponding to r 2 . However, from 
Conclusion 2 we see that if 0 < x — x 0 < R and the difference r l — r 2 is a positive 
integer , then a solution that is linearly independent of the “basic” solution of the form 
(6.49) for 0 < x — x 0 < R is of the generally more complicated form 

00 

}’ 2 (x) = (X - X 0 P X c *(x - x 0 ) n + Cy!(x)ln |x - x 0 | 

n = 0 

for 0 < x - x 0 < R. Of course, if the constant C in this solution is zero, then it reduces 
to the simpler type of the form (6.49) for 0 < x - x 0 < R. Finally, from Conclusion 3, 
we see that if r t — r 2 is zero , then the linearly independent solution y 2 W ^/ways 
involves the logarithmic term y L (x)ln | x — x 0 1 and is never of the simple form (6.49) for 
0 < x — x 0 < JR. 

We shall now consider several examples that will (1) give further practice in the 
method of Frobenius, (2) illustrate the conclusions of Theorem 6.3, and (3) indicate 
how a linearly independent solution of the more complicated form involving the 
logarithmic term may be found in cases in which it exists. In each example we shall take 
x 0 = 0 and seek solutions valid in some interval 0 < x < R. Thus note that in each 
example |x - x 0 | = |x| = x. 
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► Example 6.12 

Use the method of Frobenius to find solutions of the differential equation 

2 x ‘j^ + x j; + ^-Vy- 0 < 669 > 

in some interval 0 < x < R. 

Solution. We observe at once that x = 0 is a regular singular point of the 
differential equation (6.69). Hence, since we seek solutions for 0 < x < R, we assume a 
solution 


y = c n x n + r , 


(6.70) 


where c 0 # 0. Differentiating (6.70), we obtain 

dv 00 d 2 v 00 

— =£(« + r)c„x" +r ~ 1 and = £ (n + r)(n + r - 1 )c„x" +r ~ 2 . 
ttX n = 0 ax 11 = 0 

Upon substituting (6.70) and these derivatives into (6.69), we find 
2 £ (n + r)(n + r — l)c„x" +r + £ (n + r)c n x n+r + £ c n x n+r+2 - 3 £ c„x” +r = 0. 

n— 0 n = 0 n = 0 n = 0 

Simplifying, as in the previous examples, we write this as 


Z C 2 (" + '■)(« + r - 1) + (n + r) - 3]c„x n+r + £ c n _ 2 x n+r = 0 

m = 0 « = 2 


or 


[2 r(r - 1) + r - 3]c 0 x r + [2(r + l)r + (r + 1) - 3]c 1 x r+1 

+ £ {[2(» + + r - 1) + (n + r) - 3 ]c„ + c n _ 2 }x n+ '' = 0. (6.71) 

M = 2 

This is of the form (6.56), where k = 0. 

Equating to zero the coefficient of the lowest power of x in (6.71), we obtain the 
indicial equation 

2r(r — l)-hr — 3 = 0 or 2r 2 - r — 3 = 0. 

The roots of this equation are 

r l = | and r 2 = —1. 

Since the difference r l — r 2 = f between these roots is not zero or a positive integer, 
Conclusion 1 of Theorem 6.3 tells us that Equation (6.69) has two linearly independent 
solutions of the form (6.70), one corresponding to each of the roots r i and r 2 . 

Equating to zero the coefficients of the higher powers of x in (6.71), we obtain the 
condition 


[2(r + l)r + (r + 1) - 3]^ = 0 (6.72) 

and the recurrence formula 

[2 (n + r)(n + r - 1) + (n + r) - 3 ]c„ + c„_ 2 = 0, n > 2. (6.73) 
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Letting r = r t = § in (6.72), we obtain lc l = 0 and hence c t = 0. Letting r = r l = f in 
(6.73), we obtain (after slight simplifications) the recurrence formula 

n{2n + 5 )c n + c„_ 2 = 0, n > 2, 

corresponding to the larger root §. Writing this in the form 


we obtain 


Cf, -2 

n(2n + 5)’ 


n > 2, 


c 3 =-|i = 0 (since Cl =0), c 4 « -g = ^, . . . . 

Note that all odd coefficients are zero, since c i = 0. Letting r = f in (6.70) and using 
these values of c l5 c 2 , c 3 ,..., we obtain the solution corresponding to the larger root 
r i = This solution is y = ^x(x), where 

y,(x) = C 0 x 3/2 (1 - + 936 -* 4 -•••)• (6-74) 

Now let r = r 2 = — 1 in (6.72). We obtain -3c t = 0 and hence = 0. Letting 

r = r 2 = — 1 in (6.73), we obtain the recurrence formula 

n(2n - 5 )c n + c„_ 2 = 0, n > 2, 

corresponding to the smaller root — 1. Writing this in the form 


we obtain 


C n- 2 

n(2n - 5)’ 


n > 2, 


c 2 



C 3 


3 


= 0 (since c x = 0), 


C 2 _ C Q 

12“ 24’"" 


In this case also all odd coefficients are zero. Letting r = — 1 in (6.70) and using these 
values of c u c 2 , c 3 ,..., we obtain the solution corresponding to the smaller root r 2 = 
— 1. This solution is y = .y 2 (x), where 

y 2 (x) = c 0 x -1 (l + ix 2 - ^x 4 + • ••)• (6.75) 

Since the solutions defined by (6.74) and (6.75) are linearly independent, the general 
solution of (6.69) may be written 

y = c i)T(x) + C 2 y 2 (x), 

where C t and C 2 are arbitrary constants and y^x) and y 2 (x) are defined by (6.74) and 
(6.75), respectively. 


► Example 6.13 


Use the method of Frobenius to find solutions of the differential equation 


2 d2 y .. d y 


dx 2 


x^-(x 2 +i)y = 0 


(6.76) 


in some interval 0 < x < R. 
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Solution. We observe that x = 0 is a regular singular point of this differential 
equation and we seek solutions for 0 < x < R. Hence we assume a solution 


y=l c »*" +r ’ 


n = 0 


(6.77) 


where c 0 # 0. Upon differentiating (6.77) twice and substituting into (6.76), we obtain 


£ ( n + r X n + r - l)c„*" +r - £ (« + r)c„x" +r - £ c n x n + r + 2 - - £ c„x" + r = 0. 

n = 0 n = 0 n = 0 4 n = 0 

Simplifying, we write this in the form 


£ [(« + r)(n + r - 1) - (n + r) - |]c n x" +r - £ c n . 2 x n+ ' = 0 

n=0 n—2 


or 

[r(r - 1) - r - f]c 0 x r + [(r + l)r — (r + 1) — |]c 1 x , ' +1 

00 

+ £ {[(« + r)(n + r - 1) - (n + r) - |]c„ - c n _ 2 }x" +r = 0. (6.78) 

fi = 2 

Equating to zero the coefficient of the lowest power of x in (6.78), we obtain the 
indicial equation 

r 2 — 2r - | = 0. 

The roots of this equation are 

r - 1 r _ _1 
r l — 2? r 2 — 2* 

Although these roots themselves are not integers, the difference r l — r 2 between them is 
the positive integer 3. By Conclusion 2 of Theorem 6.3 we know that the differential 
equation (6.76) has a solution of the assumed form (6.77) corresponding to the larger 
root r l = f . We proceed to obtain this solution. 

Equating to zero the coefficients of the higher powers of x in (6.78), we obtain the 
condition 


IV + l)r — (r + 1) — l]c! = 0 (6.79) 

and the recurrence formula 

[(n + r){n + r - 1) - (n + r) - f]c„ - c„_ 2 = 0, n > 2. (6.80) 

Letting r = r Y = § in (6.79), we obtain 


4c i = 0 and hence = 0. 

Letting r = r t = f in (6.80), we obtain the recurrence formula 
n(n + 3)c„ - c n -2 =0, n > 2, 

corresponding to the larger root Since n > 2, we may write this in the form 

c n- 2 


C n = 


n(n + 3) ’ 


n> 2. 



6.2 SOLUTIONS ABOUT SINGULAR POINTS; THE METHOD OF FROBENIUS 245 


From this we obtain successively 


= 


c 3 = JY6 = 0 (since c t = 0), 


c, = 


4-7 2 • 4 • 5 • 7’ " ” 

We note that all odd coefficients are zero. The general even coefficient may be written 


2 " [2*4*6*** (2n)][5 • 7 • 9 • • • (2n + 3)]’ “ ' 

Letting r = f in (6.77) and using these values of c 2 „, we obtain the solution corre- 
sponding to the larger root r t = §. This solution is y = ^(x), where 


JlW = c 0 x 512 


= C 0 X 512 


1 + — — — + — 
2-5 2 


x 4 "I 

.' 4 : 5 . 7 + " ’ 


1+ Z 


r 2n 


n=i [2*4*6** (2n)] [5 • 7 * 9 * * • (2/t + 3)] 


(6.81) 


We now consider the smaller root r 2 = — j. Theorem 6.3 does not assure us that the 
differential equation (6.76) has a linearly independent solution of the assumed form 
(6.77) corresponding to this smaller root. Conclusion 2 of that theorem merely tells us 
that there is a linearly independent solution of the form 


£ c*x n + r2 + Cy^xjln x, (6.82) 

n = 0 

where C may or may not be zero. Of course, if C = 0, then the linearly independent 
solution (6.82) is of the assumed form (6.77) and we can let r = r 2 = — \ in the formula 
(6.79) and the recurrence formula (6.80) and proceed as in the previous examples. Let us 
assume (hopefully, but without justification!) that this is indeed the case. 

Thus we let r = r 2 = —jin (6.79) and (6.80). Letting r = —j in (6.79) we obtain 
— 2c ! =0 and hence c t = 0. Letting r— —j in (6.80) we obtain the recurrence formula 

n(n — 3 )c n — c„_ 2 = 0, n> 2, (6.83) 

corresponding to the smaller root — j. For n # 3, this may be written 


( 6 - 84 ) 

n(n — 3) 

For n = 2, formula (6.84) gives c 2 = — c 0 /2. For n = 3, formula (6.84) does not apply 
and we must use (6.83). For n = 3 formula (6.83) is 0 • c 3 — c t = 0 or simply 0 = 0 
(since c x = 0). Hence, for n = 3, the recurrence formula (6.83) is automatically satisfied 
with any choice of c 3 . Thus c 3 is independent of the arbitrary constant c 0 ; it is a second 
arbitrary constant! For n > 3, we can again use (6.84). Proceeding, we have 



C 5 


_[ 3 _ 

2-5’ 


C 4 C 0 _ C 5 _ C 3 

6*3 _ 2-4-6*3’ C? 4 • 7 _ 2 • 4 • 5 • 7’ 


We note that all even coefficients may be expressed in terms of c 0 and that all odd 
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coefficients beyond c 3 may be expressed in terms of c 3 . In fact, we may write 

Co 


C2n [2 • 4 • 6 • • • (2n)][3 • 5 • 7 • • • (2n - 3)]’ 
(even coefficients c 6 and beyond), and 

c 3 


C 2n+l ~ 


[2 • 4 • 6 • • • (2n - 2)][5 • 7 • 9 • • • (2n + 1)] ’ 


n > 3 


n > 2 


(odd coefficients c 5 and beyond). Letting r = — j in (6.77) and using the values of c n in 
terms of c 0 (for even n) and c 3 (for odd n beyond c 3 ), we obtain the solution 
corresponding to the smaller root r 2 = This solution is y = y 2 ( x )> where 


>> 2 (x) = c 0 x 1/2 


['-T 


2-4 2 • 4 • 6 • 3 


+ c,x 


■ 1/2 


X 3 + 


+ 


= C 0 X 1/2 


2-5 2 • 4 • 5 • 7 

y* 4 oo 

0 . A 


+ 




+ C 3 X 


- 1/2 


2 2-4 w = 3 [2 ■ 4 * 6 * * * (2n)][3 • 5 • 7 • • • (2n - 3)] 

00 v 2«+l 


+ I 


« = 2 [2 ■ 4 ’ 6 ■ ■ ■ (2ft 2)] [5 * 7 ■ 9 ■ ■ * (2/r -h 1)] 


(6.85) 


and c 0 and c 3 are arbitrary constants. 

If we now let c 0 = 1 in (6.81) we obtain the particular solution y = yn(x), where 


yMx) = x 


- v 5/2 


1 + I 


, 2 « 


n =. i [2 • 4 • 6 • • • (2n)] [5 • 7 • 9 • • • (2« + 3)] _ 

corresponding to the larger root § ; and if we let c 0 = 1 and c 3 = 0 in (6.85) we obtain 
the particular solution y = y 2 i(x), where 


y 2i(x) = x 


_ Y " 1/2 


v 2 y 4 oo v 2n 

< _ ^ A _ y 

2 2-4 „ = 3 [2 • 4 • 6 • • • (2n)][3 • 5 • 7 • • • (2n - 3)] 


corresponding to the smaller root These two particular solutions, which are both 
of the assumed form (6.77), are linearly independent. Thus the general solution of the 
differential equation (6.76) may be written 


y = C 1 y 11 (x) + C 2 y 21 (x), 


(6.86) 


where C i and C 2 are arbitrary constants. 

Now let us examine more carefully the solution y 2 defined by (6.85). The expression 

oo v.2«+l 


- 1/2 


+ z 


„e 2 [2 • 4 • 6 • • • (2n - 2)][5 • 7 • 9. • • • (2n + 1)] 
of which c 3 is the coefficient in (6.85) may be written 


x 5/2 1 + Z 


,2n-2 


2 [2 ■ 4 ■ 6 ■ ■ ■ (2n — 2)][5 • 7 • 9 • • -(2n + 1)] 


1+ z 


/• 2 n 


„= i [2 • 4 • 6 • • • (2n)] [5 • 7 • 9 • • • (2n + 3)] 
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and this is precisely y n (x). Thus we may write 

y 2 W = Coy2iW + c 3 yuW» (6-87) 

where c 0 and c 3 are arbitrary constants. Now compare (6.86) and (6.87). We see that the 
solution y = y 2 (x) by itself is actually the general solution of the differential equation 
(6.76), even though y 2 (x) was obtained using only the smaller root — j. 

We thus observe that if the difference r l — r 2 between the roots of the indicial 
equation is a positive integer, it is sometimes possible to obtain the general solution 
using the smaller root alone, without bothering to find explicitly the solution corre- 
sponding to the larger root. Indeed, if the difference r i — r 2 is a positive integer, it is a 
worthwhile practice to work with the smaller root first, in the hope that this smaller 
root by itself may lead directly to the general solution. 


► Example 6.14 

Use the method of Frobenius to find solutions of the differential equation 

_ d 2 y ^ dy 

x 2 + (x 2 — 3x) -y- + 3y = 0 (6.88) 

ax ax 

in some interval 0 < x < R. 

Solution. We observe that x = 0 is a regular singular point of (6.88) and we seek 
solutions for 0 < x < R. Hence we assume a solution 

y=£c„x" + ', (6.89) 

n = 0 

where c 0 # 0. Upon differentiating (6.89) twice and substituting into (6.88), we obtain 
£ (n + r)(n + r - l)c„x" +r + £ (n + r)c„x" +r+l 

n= 0 n-0 

-3 £ (n + r)c„x n+r + 3 £ c„x n+r = 0. 

n = 0 n = 0 

Simplifying, we write this in the form 

X [(« + r)(n + r — 1) — 3(n + r) + 3]c„x n+r + X (” + r ~ l) c n-i x " +r = 0 

n—0 n = 1 


or 


[r(r - 1) - 3r + 3]c 0 x' + X {[(” + ')(" + r - 1) - 3(n + r) + 3 ]c„ 

n = 1 

+ (n + r- l)c„_ 1 }x' , + r = 0. (6.90) 

Equating to zero the coefficient of the lowest power of x in (6.90) we obtain the 
indicial equation 

r 2 — 4r + 3 = 0. 

The roots of this equation are 

r 2 = 1. 


r i = 3, 
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The difference r i — r 2 between these roots is the positive integer 2. We know from 
Theorem 6.3 that the differential equation (6.88) has a solution of the assumed form 
(6.89) corresponding to the larger root = 3. We shall find this first, even though the 
results of Example 6.13 suggest that we should work first with the smaller root r 2 = 1 in 
the hopes of finding the general solution directly from this smaller root. 

Equating to zero the coefficients of the higher powers of x in (6.90), we obtain the 
recurrence formula 

l(n + r)(n + r — 1) — 3(n + r) + 3]c„ + (n + r — \)c n - l =0. n> 1. (6.91) 
Letting r = r i = 3 in (6.91), we obtain the recurrence formula 
n(n + 2)c„ + (n + 2)c„ _ i = 0, n > 1 , 

corresponding to the larger root 3. Since n > 1, we may write this in the form 


From this we find successively 

Ci c 0 c 2 c 0 (-l)"c 0 

c i - c o> c 2- 2 ”2!’ ~ 3 — 3! ’ * ’ ‘ n \ 


Letting r = 3 in (6.89) and using these values of c„, we obtain the solution corre- 
sponding to the larger root r l = 3. This solution is y =yi(x), where 

3 f x 2 x 3 (— l)"x" 1 

hW = c o* \ 1_x + Y _ 3T + ' + — n \ — + ■ ‘ ' • 

We recognize the series in brackets in this solution as the Maclaurin expansion for e~ x . 
Thus we may write 

y l {x) = c 0 x 3 e~ x (6.92) 

and express the solution corresponding to r Y in the closed form 

y = c 0 x 3 e~ x , 

where c 0 is an arbitrary constant. 

We now consider the smaller root r 2 = 1. As in Example 6.13, we have no assurance 
that the differential equation has a linearly independent solution of the assumed form 
(6.89) corresponding to this smaller root. However, as in that example, we shall 
tentatively assume that such a solution actually does exist and let r = r 2 = 1 in (6.91) in 
the hopes of finding “it.” Further, we are now aware that this step by itself might even 
provide us with the general solution. 

Thus we let r = r 2 = 1 in (6.91) to obtain the recurrence formula 

n(n — 2 )c n + nc n - t = 0, n > 1, (6.93) 

corresponding to the smaller root 1. For n # 2, this may be written 

c„ = n>\, 2. (6.94) 


For n = 1, formula (6.94) gives c x = c 0 .Forn = 2, formula (6.94) does not apply and we 
must use (6.93). For n = 2 formula (6.93) is 0 • c 2 + 2c t = 0, and hence we must have 
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= 0. But then, since c t = c 0 , we must have c 0 = 0. However, c 0 # 0 in the assumed 
solution (6.89). This contradiction shows that there is no solution of the form (6.89), 
with c 0 # 0, corresponding to the smaller root 1. 

Further, we observe that the use of (6.94) for n > 3 will only lead us to the solution y t 
already obtained. For, from the condition 0 • c 2 + 2c t = 0 we see that c 2 is arbitrary; 
and using (6.94) for n > 3, we obtain successively 


c 3 — — c 2 , 


<4 = 



^2 
2 ! ’ 


C 5 


£4 

3 


<2 _(— 1)"C 2 

1 ? • • • » Cn + 2 ■ 


,n > 1. 


Thus letting r = 1 in (6.89) and using these values of c„, we obtain formally 

x 4 x 5 


y = c 2 x 


= C 2 X 3 


c - x + ¥"¥ + 
X 2 X 3 

X + ¥“ 3!~ + ' 


(-l)"x n 
' + n\ + 

n\ 


= c 2 x 3 e x . 


Comparing this with (6.92), we see that it is essentially the solution y = yi(x). 

We now seek a solution of (6.88) that is linearly independent of the solution y t . From 
Theorem 6.3 we now know that this solution is of the form 


X c*x" +1 + Cy 1 (x)lnx, (6.95) 

w = 0 

where # 0 and C # 0. Various methods for obtaining such a solution are available; 
we shall employ the method of reduction of order (Section 4.1). We let y = f(x)v , where 
/(x) is a known solution of (6.88). Choosing for / the known solution y t defined by 
(6.92), with c 0 = 1, we thus let 

y — x 3 €~ x v. (6.96) 

From this we obtain 

^ * + ( 3 x 2 e -x _ x 3 e -X) v ( 6 . 97 ) 

ax ax 

and 

= x 3 €~ x — 7 — y + 2(3x 2 c _x — x 3 e~ x )^~ + (x 3 c _x — 6x 2 ^ _x + 6xc“ x )i;. (6.98) 
ax ax ax 


Substituting (6.96), (6.97), and (6.98) for y and its first two derivatives, respectively, in 
the differential equation (6.88), after some simplifications we obtain 


d^V uv 


dv 

dx 


(6.99) 


Letting w = dv/dx , this reduces at once to the first-order differential equation 
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a particular solution of which is w = x 3 e x . Thus a particular solution of (6.99) is given 
by 

v = j* x~ 3 e x dx , 

and hence y = y 2 (x), where 

y 2 (x) = x 3 e~ x J x~ 3 e x dx (6.100) 

is a particular solution of (6.88) that is linearly independent of the solution y t defined 
by (6.92). 

We now show that the solution y 2 defined by (6. 100) is of the form (6.95). Introducing 
the Maclaurin series for e x in (6.100) we have 


y 2 (x) = x 3 e x 
= x 3 e~ x 


_ 3 / X 2 x 3 x x \ 

x 1 (>+ ) ‘ + y + l- + 24 + ) 4x 
(x- s + x - 2 + ix-' + i + ^ + - ■■'jdx. 


Integrating term by term, we obtain 


y 2 (x) = X V 


111 , 1 1 2 

-t— 2 fylnx + yx+y^x 

2x l x 2 6 48 


Now introducing the Maclaurin series for e x ,/we may write 


yi(x) 


-lx 3 


x* + ■ 


X” 

~ 6 ~ 


+ ••• - 


1 

2 ? 


11 1 2 

•- + 7 X +TZ X 
x 6 48 



+ jx 3 e~ x In x. 

Finally, multiplying the two series involved, we have 

y 2 (x) = {-\x - \x 2 + |x 3 - |x 4 + • * *) + \x 3 e~ x In x, 

which is of the form (6.95), where ^(x) = x 3 e“ x . The general solution of the differential 
equation (6.88) may thus be written 


y = + C 2 y 2 (x), 

where C Y and C 2 are arbitrary constants. 

In this example it was fortunate that we were able to express the first solution y t in 
closed form, for this simplified the computations involved in finding the second 
solution y 2 . Of course the method of reduction of order may be applied to find the 
second solution, even if we cannot express the first solution in closed form. In such 
cases the various steps of the method must be carried out in terms of the series 
expression for y t . The computations that result are generally quite complicated. 


Examples 6.12, 6.13, and 6.14 illustrate all of the possibilities listed in the conclusions 
of Theorem 6.3 except the case in which r t — r 2 = 0 (that is, the case in which the roots 
of the indicial equation are equal). In this case it is obvious that for 0 < x — x 0 < JR 
both roots lead to the same solution 


oo 


y 1 =(x-x 0 ) r X c„(x-x 0 ) n , 

n = 0 
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where r is the common value of r i and r 2 . Thus, as Conclusion 3 of Theorem 6.3 states, 
for 0 < x — x 0 < R, a linearly independent solution is of the form 

oo 

y 2 =(x- X 0 ) r+1 £ C*(x - X 0 y + J'l (x)ln(x - X 0 ). 

n = 0 

Once y l (x) has been found, we may obtain y 2 by the method of reduction of order. This 
procedure has already been illustrated in finding the second solution of the equation in 
Example 6. 1 4. A further illustration is provided in Section 6.3 by the solution of Bessel’s 
equation of order zero. 


Exercises 


Locate and classify the singular points of each of the differential equations in Exercises 
1-4. 


1. (x 2 — 3x) + (x + 2) ^ + y = 0. 

2. (x 3 + x 2 ) + (x 2 — 2x) y^- -b 4 y = 0. 

dx z ax 


3. (x 4 — 2x 3 + x 2 ) + 2(x — 1) + x 2 y = 0. 

ax ax 

4. (x 5 + x 4 — 6x 3 ) ^-y + x 2 ^ + (x — 2)y = 0. 

ax ax 


Use the method of Frobenius to find solutions near x = 0 of each of the differential 
equations in Exercises 5-26. 

d 2 y dy 
dx 2 + dx 


5. 2x 2 y-y + x ^ -b (x 2 — l)y = 0. 


6. 2x 2 + x + (2x 2 — 3)y = 0. 


7. x' 


8. x 


9. x 


dx 2 

d 2 y 

dx 2 

d 2 y 

dx 2 

d 2 y 

dx 2 


+ x 2 +- y = 0. 


+ x 


dx 

dy_ 

X dx 

4 x+ [2^ + i)y-0. 

dy 


dx 


+ * -sb = °- 


10. 2x ^-4 + + 2y = 0. 

dx z dx 


11. 3x 


d 2 y 


dy 


dx 




12. x 


d 2 y , 


+ 2 + x_y = 0. 


dx 2 ' ~ dx 
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13. 

14. 

15. 

16. 

17. 

18. 

19. 

20 . 

21 . 

22 . 

23. 

24. 

25. 

26. 


2 d^ 

dx 2 


dy_ 

dx 


y = o. 


.g + ( ^ + 4_,_a 


d 2 y 


dy 


+2 ^ +x, -°- 


2 djy 

dx 2 


2 di 

dx 


x* + x“ -r- ~ 2y = 0. 


(2x 2 - x )^T + ( 2x - 2 )^ + ( ~ 2x 2 + 3x - 2 )y = 0. 


X" 


d 2 y dy 3 


dx 2 


*Tx + S'* 0 ' 


d 2 y dy 
dx 2 + X dx 


x~ — j + x -f - + (x — l)y = 0. 
d 2 y 


xl 4? + ,x, - x ^- }r ’°- 

d 2 y dy - „ 

X dx I + dx + 2y ~°' 

. d 2 y , dy 

2x d^ +6 I + y -°- 

.^-x^ + ^ + Dj-o. 


dx 2 dx 


,i d _y 2 


dx 2 ’ dx 


Tj- + (x - 3)y = 0. 


6.3 BESSEL'S EQUATION AND BESSEL FUNCTIONS 
A. Bessel's Equation of Order Zero 

The differential equation 

x 2 + x ^ + (x 2 - p 2 )y = 0, (6.101) 

where p is a parameter, is called Bessel's equation of order p. Any solution of Bessel’s 
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equation of order p is called a Bessel function of order p. Bessel’s equation and Bessel 
functions occur in connection with many problems of physics and engineering, and 
there is an extensive literature dealing with the theory and application of this equation 
and its solutions. 

If p = 0, Equation (6.101) is equivalent to the equation 

x + -t~ + *y = 0, (6.102) 

dx 1 dx 

which is called Bessel's equation of order zero . We shall seek solutions of this equation 
that are valid in an interval 0 < x < R. We observe at once that x = 0 is a regular 
singular point of ( 6 . 102 ); and hence, since we seek solutions for 0 < x < R, we assume a 
solution 


y= £ c » x " +r > 


n = 0 


(6.103) 


where c 0 ^ 0. U pon differentiating (6. 1 03) twice and substituting into ( 6 . 1 02), we obtain 
£ (n + r)(n + r — l)c„x " +r-1 + £ (n + r)c n x n+r ~ l + £ c„x" + r+ 1 = 0 . 

m=0 n = 0 n= 0 

Simplifying, we write this in the form 


£ (« + r) 2 c n x n+r 1 + £ c„_ 2 x n+r 1 = 0 

n = 0 n = 2 


or 

r 2 c 0 x r ~ l + (1 + r) 2 c i x r + £ [(« + r) 2 c„ + c n _ 2 ]x" + r “ 1 = 0. (6.104) 

n-2 

Equating to zero the coefficient of the lowest power of x in (6.104), we obtain the 
indicial equation r 2 = 0, which has equal roots r t = r 2 = 0. Equating to zero the 
coefficients of the higher powers of x in (6.104) we obtain 

(1 + r) 2 c i = 0 (6.105) 

and the recurrence formula 

(n + r) 2 c n + c„- 2 = 0 , n > 2 . (6.106) 

Letting r = 0 in (6.105), we find at once that c t = 0. Letting r = 0 in (6.106) we obtain 
the recurrence formula in the form 

m 2 c w + c„_ 2 = 0 , h > 2 , 

or 


From this we obtain successively 

C 0 Ci . . Co Cq 

C 2 == == ® ( since — 0), C 4 — ^2 = 22 T 42 ’ • • • • 
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We note that all odd coefficients are zero and that the general even coefficient may be 
written 


c (~i)% (-D% 

=• 2 ! -4'6 : |2«| ! |«!| ! 2'-’ ~ 

Letting r = 0 in (6.103) and using these values of c 2 „, we obtain the solution y = ^(x), 
where 


yi(x) = c 0 




2 n 


If we set the arbitrary constant c 0 = 1, we obtain an important particular solution of 
Equation (6.102). This particular solution defines a function denoted by J 0 and called 
the Bessel function of the first kind of order zero. That is, the function J 0 is the particular 
solution of Equation (6.102) defined by 




(„!)> V 2 

Writing out the first few terms of this series solution, we see that 


J 0 (x) = 1 - 


1 


( l !) 2 

2 


, X^ X^ 

~T + 64 


2 J_ 

+ ( 2!) 2 

v -6 


1 


( 3!) 2 


x' 

2 + " 


2304 


(6.107) 


(6.108) 


Since the roots of the indicial equation are equal, we know from Theorem 6.3 that a 
solution of Equation (6.102) which is linearly independent of J 0 must be of the form 

00 

y = * Z c * x " + Jo(*)lnx, 

« = 0 


for 0 < x < R. Also, we know that such a linearly independent solution can be found by 
the method of reduction of order (Section 4.1). Indeed from Theorem 4.7 we know that 
this linearly independent solution y 2 is given by 


and hence by 


j' 2 W = M*) 


e ~t <*x/x 

Uo w ] 2 


dx 


y 2 {x) = J 0 (x) 


dx 

^[•foW] 2 ’ 


From (6.108) we find that 


[ioW ] 2 = l 


x 2 3x 4 5x 6 

T + 1T _ 576 + "‘ 


and hence 

1 x 2 5x 4 23x 6 

[J 0 (x)] 2 = + T + ~i2 + ~5T6 + '" 
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Thus 


y 2 (x) 




x 5x 3 23x 5 

+ 2 + l2 + ~516 + '" 


= J 0 ( X )* n X + 


dx 


, ( t x 2 5x 4 23x 6 

= J ° {x \ \ nx + T + — + — + ■■■ 
/ x 2 X 4 x^ 

= J,Mlnx+ l- T +j 4 - 25 o 5 + 


3X4 11X6 
12S + 13824 + ' ' 


x 2 5x 4 23x 6 

T + 128 + 3456 + " ’ 


We thus obtain the first few terms of the “second” solution y 2 by the method of 
reduction of order. However, our computations give no information concerning the 
general coefficient c\ n in the above series. Indeed, it seems unlikely that an expression 
for the general coefficient can be found. However, let us observe that 


( -’) 2 2w (1)= 44 


(-i) 3 


i 


2 4 ( 2!) 2 


l+^l=- 


(-D 4 


2 6 (3!) 2 


1 +^ + {l = 

2 3 


2 4 • 2 2 • 2 . 128’ 

11 11 


2 6 • 6 2 • 6 13824 


Having observed these relations, we may express the solution y 2 in the following more 
systematic form: 

v-6 


X 2 X 4 

y 2 (x) = *f 0 (x)ln X + J 2 - 2 4 2 2 


l+x) + 


2 6 (3!) 2 


i 1 1 . 

1+ 2 + 3, l + 


Further, we would certainly suspect that the general coefficient cf„ is given by 


c 


* _ 
2fi — 


(-ir 1 

2 2 "(rc !) 2 


i 1 1 1 

1 + X + T + * “ + - 

2 3 n 


n> 1. 


It may be shown (though not without some difficulty) that this is indeed the case. This 
being true, we may express y 2 in the form 


y 2 (x) = d 0 (x)ln x + £ 


(- 1 ) 


n + 1 yin 


2 2 "(n !) 2 


, 1 1 

I T + ~z + 
2 3 


1 

H — 
n 


(6.109) 


Since the solution y 2 defined by (6.109) is linearly independent of J 0 we could write the 
general solution of the differential equation (6.102) as a general linear combination of 
J 0 and y 2 . However, this is not usually done; instead, it has been customary to choose a 
certain special linear combination of J 0 and y 2 and take this special combination 
as the “second” solution of Equation (6.102). This special combination is defined by 


- [j'zW + (7 - In 2)J 0 (x)], 

71 
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where y is a number called Euler's constant and is defined by 


y = lim 


i 11 1 i 

1 + — + — + * * * H In n 

2 3 n 


: 0.5772. 


It is called the Bessel function of the second kind of order zero (Weber’s form) and is 
commonly denoted by Y 0 . Thus the second solution of (6. 1 02) is commonly taken as the 
function Y 0 , where 





J 0 (x ) In x + 


oo 


l 


(-l) n+ 1 X 2n 

2 2 "(n!) 2 


1 1 1 

+ x + r + ' ' ' + - 

2 3 n 


+ (y - In 2)j 0 (x) 


or 


TO 


-§[ 


ln j + v JTO + 


(-1)" +1 X 2 " 

2 2 "(n!) 2 


, 1 1 

1+ 2 + 3 + 


+ ' 


. ( 6 . 110 ) 


Therefore if we choose Y 0 as the second solution of the differential equation (6.102), the 
general solution of (6.102) for 0 < x < R is given by 


y = c l J 0 (x) + c 2 Y 0 (x), (6.111) 

where c l and c 2 are arbitrary constants, and J 0 and Y 0 are defined by (6. 107) and (6. 1 10), 
respectively. 

The functions J 0 and Y 0 have been studied extensively and tabulated. Many of the 
interesting properties of these functions are indicated by their graphs, which are shown 
in Figure 6.1. 


B. Bessel's Equation of Order p 

We now consider Bessel’s equation of order p for x > 0, which we have already 
introduced at the beginning of Section 6.3A, and seek solutions valid for 0 < x < R. 
This is the equation 

x 2 + x ^ + “ p ^ y = (6.101) 



Figure 6.1 
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where we now assume that p is real and positive. We see at once that x = 0 is a regular 
singular point of Equation (6.101); and since we seek solutions valid forO < x < R, we 
may assume a solution 

y= £ c„x" +r , (6.112) 

M = 0 

where c 0 # 0. Differentiating (6.1 12), substituting into (6.101), and simplifying as in our 
previous examples, we obtain 

(r 2 - p 2 )c 0 x r + [(r + l) 2 - p 2 ]c 1 x r+1 

+ z {[(n + f) 2 -p 2 ]c B + c„_ 2 }x' ,+r = 0. (6.113) 

n = 2 

Equating to zero the coefficient of each power of x in (6.113), we obtain 

r 2 -p 2 = 0, (6.114) 

[(r + l) 2 - p 2 ] Cl = 0, (6.115) 

and 

[(n + r) 2 — p 2 ]c n -f c„- 2 — 0, n> 2. (6.116) 

Equation (6. 1 14) is the indicial equation of the differential equation (6. 101). Its roots are 
r l = p > 0 and r 2 = - p. If r t - r 2 = 2p > 0 is unequal to a positive integer, then from 
Theorem 6.3 we know that the differential equation (6.101) has two linearly inde- 
pendent solutions of the form (6.112). However, if r l - r 2 = 2 p is equal to a positive 
integer, we are only certain of a solution of this form corresponding to the larger root 
r i = p. We shall now proceed to obtain this one solution, the existence of which is 
always assured. 

Letting r = r t = p in (6.1 15), we obtain (2 p + 1 )c i = 0. Thus, since p > 0, we must 
have = 0. Letting r = r l = p in (6.1 16), we obtain the recurrence formula 

n(n + 2 p)c n + c„_ 2 = 0, n > 2, 


or 


c 


n 


C n- 2 

n(n + 2pY 


n> 2, 


(6.117) 


corresponding to the larger root p. From this one finds that all odd coefficients are zero 
(since = 0) and that the general even coefficient is given by 


C 2n 


(-1 )'Co 

[2 • 4 • • • {In)-] [(2 + 2p)(4 + 2 p) • {In + 2 p)] 


l)"c 0 

2 2 "n![(l +p)(2 + p) •••(« + ?)]’ 


n > 1. 


Hence the solution of the differential equation (6.101) corresponding to the larger root 
p is given by y = y^x), where 


.Vi(x) = c 0 


l)"x 2 " + p 

2 2 "n! [(1 + p)(2 + p) ••'(« + p)] 


(6.118) 
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If p is a positive integer, we may write this in the form 


= c 0 2 p p\ 


«= o 


(-1 r 

n\(n + p)\ 



(6.119) 


If p is unequal to a positive integer, we need a generalization of the factorial function 
in order to express y^x) in a form analogous to that given by (6.119). Such a gen- 
eralization is provided by the so-called gamma function , which we now introduce. 
For N > 0 the gamma function is defined by the convergent improper integral 


r(N) 


-i: 


? x x N 1 dx. 


( 6 . 120 ) 


If N is a positive integer, it can be shown that 


N\ = F(N + 1). (6.121) 

If N is positive but not an integer, we use (6.121) to define N \ . The gamma function has 
been studied extensively. It can be shown that F(N) satisfies the recurrence formula 

r(N + 1) = NF(N) ( N > 0). (6.122) 

Values of F(N) have been tabulated and are usually given for the range 1 < N < 2. 
Using the tabulated values of F(N) for 1 < N < 2, one can evaluate F(N) for all N > 0 
by repeated application of formula (6.122). Suppose, for example, that we wish to 
evaluate (f)!. From the definition (6.121), we have (f)! = T(f). Then from (6.122), we find 
that r(f) = fr(f). From tables one finds that T(|) % 0.8862, and thus (f)! = r(f) « 
1.3293. 

For N < 0 the integral (6.120) diverges, and thus T(N) is not defined by (6.120) for 
negative values of N. We extend the definition of T(N) to values of N < 0 by 
demanding that the recurrence formula (6.122) hold for negative (as well as positive) 
values of N. Repeated use of this formula thus defines F(N) for every nonintegral 
negative value of N . 

Thus F(N) is defined for all N ^ 0, - 1, —2, -3,.... The graph of this function is 
shown in Figure 6.2. We now define N\ for all N ^ —1, —2, —3,... by the formula 
(6.121). 

We now return to the solution defined by (6. 1 1 8), for the case in which p is unequal 
to a positive integer. Applying the recurrence formula (6.122) successively with N = 
n + p, n + p — 1, n + p — 2, . . . , p + 1, we obtain 

r(n + p + 1) = (n + p)(n + p - 1) • • • (p + l)r(p + 1). 

Thus for p unequal to a positive integer we may write the solution defined by (6.1 18) in 
the form 


hW = c«r(p + i) 


oo (_ 1)nx 2 n + P 

„h 2 2 "n!r(n + p + 1) 


= c 0 2T(p+ 1) £ 

n = 0 


(- 1 )" 

n!r(n + p+ 1) 



(6.123) 


Now using (6. 121) with N = p and ;V = n + p, we see that (6. 123) takes the form (6. 119). 
Thus the solution of the differential equation (6.101) corresponding to the larger 
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root p > 0 is given by (6.119), where p\ and {n + p)\ are defined by T(p + 1) and 
r(n + p + 1), respectively, if p is not a positive integer. 

If we set the arbitrary constant c 0 in (6.1 19) equal to the reciprocal of 2 P p\, we obtain 
an important particular solution of (6.101). This particular solution defines a function 
denoted by J p and called the Bessel function of the first kind of order p. That is, the 
function J p is the particular solution of Equation (6.101) defined by 


oo 

Jp{X) = „?o n!(n + p)\ 



(6.124) 


where (n + p)\ is defined by T(n + p + 1) if p is not a positive integer. 

Throughout this discussion we have assumed that p > 0 in (6.101) and hence that 
p > 0 in (6. 124). If p = 0 in (6.101), then (6.101) reduces to the Bessel equation of order 
zero given by (6.102) and the solution (6.124) reduces to the Bessel function of the first 
kind of order zero given by (6.107). 

If p = 1 in (6.101), then Equation (6.101) becomes 



+ x -j— + (x 2 — \)y = 0, 
dx 


(6.125) 


which is Bessel’s equation of order one. Letting p = 1 in (6.124) we obtain a solution of 
Equation (6.125) that is called the Bessel function of the first kind of order one and is 
denoted by J l . That is, the function J t is defined by 


Ji(x)= I 


(- 1 )" 
n!(n + 1)! 


2 »+ 1 


(6.126) 
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The graphs of the functions J 0 , defined by (6.107), and J u defined by (6.126), are shown 
in Figure 6.3. 

Several interesting properties of the Bessel functions of the first kind are suggested 
by these graphs. For one thing, they suggest that J 0 and J i each have a damped oscilla- 
tory behavior and that the positive zeros of J 0 and J t separate each other. This is indeed 
the case. In fact, it may be shown that for every p > 0 the function J p has a damped 
oscillatory behavior as x -► oo and the positive zeros of J p and J p+l separate each other. 

We now know that for every p > 0 one solution of Bessel’s equation of order p 
(6.101) is given by (6.124). We now consider briefly the problem of finding a linearly 
independent solution of (6.101). We have already found such a solution for the case in 
which p = 0; it is given by (6. 110). Forp > 0, we have observed that if 2p is unequal to a 
positive integer, then the differential equation (6.101) has a linearly independent 
solution of the form (6.112) corresponding to the smaller root r 2 = —p. We now 
proceed to work with this smaller root. 

Letting r = r 2 = — p in (6.1 15), we obtain 

( — 2p + 1 )c t =0. (6.127) 

Letting r = r 2 = —p in (6.1 16) we obtain the recurrence formula 

n(n - 2p)c n + c„_ 2 = 0, n > 2, (6.128) 


or 


c 


n 


C n- 2 

n(n — 2 p) ’ 


n> 2, n # 2p. 


(6.129) 


Using (6.127) and (6.128) or (6.129) one finds solutions y = y 2 (x), corresponding to the 
smaller root — p. Three distinct cases occur, leading to solutions of the following forms: 


1. If 2p / a positive integer, 

y 2 (x) = C 0 X~ P [ 1 + ^ a 2 nX 2n ), 


n = 1 


(6.130) 


where c 0 is an arbitrary constant and the oc 2n (n = 1, 2, . . .) are definite constants. 
2. If 2 p = an odd positive integer, 


y 2 (x) = c 0 x p 1 + Z P 2 n* 2 " + ^ 2 pX p 1 + X y 2n 


n = 1 


n = 1 


(6.131) 



Figure 6.3 
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where c 0 and c 2p are arbitrary constants and /? 2 „ and y 2n (n = 1,2,...) are definite 
constants. 

3. If 2 p = an even positive integer, 

yi(x) = c 2p x p (l + f <5 2 „x 2 "^, (6.132) 

where c 2p is an arbitrary constant and the d 2n (n = 1, 2,. . .) are definite constants. 

In Case 1 the solution defined by (6.130) is linearly independent of J p . In Case 2 the 
solution defined by (6.131) with c 2p = 0 is also linearly independent of J p . However, in 
Case 3 the solution defined by (6.132) is merely a constant multiple of J p (x ), and hence 
this solution is not linearly independent of J p . Thus if 2 p is unequal to an even positive 
integer, there exists a linearly independent solution of the form (6.112) corresponding 
to the smaller root —p. In other words, if p is unequal to a positive integer, the 
differential equation (6.101) has a solution of the form y = y 2 (x), where 


00 


J'zW = Z C 2 n x 2n p , 

n = 0 


(6.133) 


and this solution y 2 is linearly independent of J p . 

It is easy to determine the coefficients c 2n in (6.133). We observe that the recurrence 
formula (6.129) corresponding to the smaller root —p is obtained from the recurrence 
formula (6.117) corresponding to the larger root p simply by replacing p in (6.117) by 
-p. Hence a solution of the form (6.133) may be obtained from (6.124) simply by 
replacing p in (6.124) by —p. This leads at once to the solution denoted by J_ p and 
defined by 


J-p(x) 


* (- 1 )” 

> 

= o n\(n - p)\ 



(6.134) 


where (n — p)\ is defined by T(n — p + 1). 

Thus if p > 0 is unequal to a positive integer, two linearly independent solutions of 
the differential equation (6.101) are J p defined by (6.124), and‘J_ p , defined by (6.134). 
Hence, if p > 0 is unequal to a positive integer, the general solution of Bessel’s equation 
of order p is given by 

y = C x J p (x) + C 2 J_ p (x), 

where J p and J_ p are defined by (6.124) and (6.134), respectively, and C { and C 2 are 
arbitrary constants. 

If p is a positive integer, the corresponding solution defined by (6.123) is not linearly 
independent of J p , as we have already noted. Hence in this case a solution that is 
linearly independent of J p must be given by y = y p (x), where 

X 

y,(x) = x~ p Z c * x " + CJpMln X, 

n = 0 


where C # 0. Such a linearly independent solution y p may be found by the method of 
reduction of order. Then the general solution of the differential equation (6. 101) may be 
written as a general linear combination of J p and y p . However, as in the case of Bessel’s 
equation of order zero, it is customary to choose a certain special linear combination of 
J p and y p and take this special combination as the “second” solution of Equation 
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+ 


(6.101). This special combination is denoted by Y p and defined by 

<6 - i35) 

where y is Euler’s constant. The solution Y p is called the Bessel function of the second 
kind of order p (Weber’s form). 

Thus if p is a positive integer, two linearly independent solutions of the differential 
equation (6.101) are J p9 defined by (6.124), and Yp, defined by (6.135). Hence if p is a 
positive integer, the general solution of Bessel’s equation of order p is given by 

y = c^pix) + c 2 y p (x), 

where J p and Y p are defined by (6.124) and (6.135), respectively, and C l and C 2 are 
arbitrary constants. 

Exercises 

1. Show that J 0 (kx) 9 where k is a constant, satisfies the differential equation 

d2 y , d y , i,2. 


x m ~r ~2 H" ~f~ + k 2 xy = 0. 
dx 2 dx 


2. Show that the transformation 


y = 


u(x) 


reduces the Bessel equation of order p, Equation (6.101), to the form 
d 2 u 


dx 2 


+ 


_■ + (3 - p2 ) ^] u = a 


3. Use the result of Exercise 2 to obtain a solution of the Bessel equation of order 

4. Using the series definition (6.124) for J p9 show that 

d 


dx 


Vx p J p (kx)']=kx p J p . l {kx) 


and 


dx 


[x p J p {kx)] = -kx p J p + l (kx) 9 


where k is a constant. 

5. Use the results of Exercise 4 to show that 

d 


dx [Jp(fcx)] = kJ p -i(kx) - V - J p {kx), 


[Jp(M] = ~kJ p+l (kx) + V - J p (kx). 
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Hence show that 


lJ p (kx)~\ =^lJ p -i(kx) - J p+1 (/cx)], 
lex 

— + J p+l (kx)~\. 

6. Using the results of Exercise 5, 

(a) express J t (x) and [^(x)] in terms of J 0 (x) and J 2 (x)' 9 

ax 

(b) express J n+ 1/2 (x) in terms of J„_ 1/2 (x) and J n - 3/2 (x). 



CHAPTER SEVEN 

Systems of Linear Differential Equations 


In the previous chapters we have been concerned with one differential equation in one 
unknown function. Now we shall consider systems of two differential equations in two 
unknown functions, and more generally, systems of n differential equations in n 
unknown functions. We shall restrict our attention to linear systems only, and we shall 
begin by considering various types of these systems. After this, we shall proceed to 
introduce differential operators, present an operator method of solving linear systems, 
and then consider some basic applications of this method. We shall then turn to a study 
of the fundamental theory and basic method of solution for a standard type of linear 
system in the special case of two equations in two unknown functions. Following this, 
we shall outline the most basic material about matrices and vectors. We shall then 
present the basic method for the standard type of linear system by means of matrices 
and vectors, first in the special case of two equations in two unknown functions and 
then in the more general case of n equations in n unknown functions. The fundamental 
theory for this general case will be presented in Chapter 1 1. 


7.1 DIFFERENTIAL OPERATORS AND AN OPERATOR METHOD 


A. Types of Linear Systems 


We start by introducing the various types of linear systems that we shall consider. The 
general linear system of two first-order differential equations in two unknown 
functions x and y is of the form 


dx dy 

a i(t) + a ^)-j t + a i(t)x + a 4 (t)y = F,(t), 

bi (r) ^ + b 2 (t) j- + b 3 (t)x + b A (t)y = F 2 (t). 


264 
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We shall be concerned with systems of this type that have constant coefficients. An 
example of such a system is 


2 


dx 

It 



— 2 x + y = t 2 , 


dx_ 2 ^l 

dt dt 


+ 3x + Ay = 


e\ 


We shall say that a solution of system (7.1) is an ordered pair of real functions ( f g) 
such thatx = f(t),y = g(t) simultaneously satisfy both equations of the system (7.1) on 
some real interval a < t < b. 

The general linear system of three first-order differential equations in three unknown 
functions x, y, and z is of the form 


+ dt + a *^ x + a ^ y + a ^ z = Fl ^’ 
dx dy dz 

bl ^~dt + bl ^~dt + b ^dt + b ^ X + bs ^ y + b(, ^ Z = 
dx dy dz 

Ci(t) — + C 2 (r) — c 3 (t)^- + C 4 (t)x + c 5 (t)y + c 6 (t)z = F 3 (t). 


(7.2) 


As in the case of systems of the form (7.1), so also in this case we shall be concerned with 
systems that have constant coefficients. An example of such a system is 


dx dy ^ dz 
— + - 2 — + 2x 

dt dt dt 


3y + z = t, 


- dx dy _ dz 

2 — — b 3 — — b x + 4 y — 5z — sin t, 

dt dt dt 

dx ^ dy dz 

— — b 2 — — I — 3x -b 2y — z — cos t. 

dt dt dt 


We shall say that a solution of system (7.2) is an ordered triple of real functions (/, g , h) 
such that x = f(t), y = g(t ), z = h(t) simultaneously satisfy all three equations of the 
system (7.2) on some real interval a < t < b. 

Systems of the form (7.1) and (7.2) contained only first derivatives, and we now 
consider the basic linear system involving higher derivatives. This is the general linear 
system of two second-order differential equations in two unknown fucntions x and y, 
and is a system of the form 

d 2 x d 2 y dx dy 

a lW-^T + a 2(t)-^2+ MO^ + + = f lW’ 

MO^pr + + Mf)~~ + MO + MO* + b 6 (t)y = F 2 (t). 


We shall be concerned with systems having constant coefficients in this case also, and 
an example is provided by 


d 2 x 


■ d*y 

dt 2 


, dx 
dt 


2 -fcT + 5 172 +7 ~ + 3 ^7 + 2y = 3t + 1, 


dy 

dt 


3 


d 2 x 

dt 2 




+ 4x -b y = 0. 
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For given fixed positive integers m and n, we could proceed, in like manner, to exhibit 
other general linear systems of n mth-order differential equations in n unknown 
functions and give examples of each such type of system. Instead we proceed to 
introduce the standard type of linear system referred to in the introductory paragraph 
at the start of the chapter, and of which we shall make a more systematic study later. 
We introduce this standard type as a special case of the system (7.1) of two first-order 
differential equations in two unknowns functions x and y. 

We consider the special type of linear system (7.1), which is of the form 

dx 

= a n (t)x + a 12 (t)y + F { (t), 


dt 
dy 

— = a 2l (t)x + a 22 (t)y + F 2 (t). 


(7.4) 


This is the so-called normal form in the case of two linear differential equations in two 
unknown functions. The characteristic feature of such a system is apparent from the 
manner in which the derivatives appear in it. An example of such a system with variable 
coefficients is 

dx , , 

— = t 2 x + {t + l)y + t\ 

% = te'x + t 3 y - e‘, 
dt 


while one with constant coefficients is 

dx 


dt 


= 5x + ly + r 


^ = 2x — 3 y + 2 1. 
dt 


The normal form in the case of a linear system of three differential equations in three 
unknown functions x, y, and z is 


dx 

li 


= a n (t)x + a i2 {t)y + a i3 (t)z + F^t). 


dy 

— = a 2l (t)x + a 22 (t)y + a 23 (t)z + F 2 (t), 


dz 

di 


= <*31 W* + a3z(t)y + azz(t)z + F 2 (t). 


An example of such a system is the constant coefficient system 

dx 


dt 


= 3x + 2y + z t, 


dy 

— = 2x — Ay + 5z — t 2 , 
dt 


dz 

— = 4x + v — 3z + 2t + 1. 
dt 
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The normal form in the general case of a linear system of n differential equations in n 
unknown functions x t , x 2 , . . . , x„ is 

dx x 

— = a ll (t)x l + a 12 (t)x 2 + • • • + a ln (t)x„ + F x (t), 

dXy 

= a 2l(t) X l + ^2l( t ) X 2 + * ‘ ‘ + tX 2n {t)X n + ( 7 - 5 ) 


dXn 

dt 


= a nl (t)x l + a n2 (t)x 2 + ■■■ + a„„(t)x n + F„{t). 


An important fundamental property of a normal linear system (7.5) is its relationship 
to a single nth-order linear differential equation in one unknown function. Specifically, 
consider the so-called normalized (meaning, the coefficient of the highest derivative is 
one) nth-order linear differential equation 

d n x d n — * x dx 

+ a i (0 + ' ' ' + a«- 1 (0 + a n(t) x = F(t) (7.6) 


in the one unknown function x. Let 


dx 



X 3 


d 2 x 


_d n ~ 2 x 
~ dt n ~ 2 9 


d n ~ l x 
dt n ~ l * 


From (7.7), we have 

dx dx x 
dt ~ dt 9 


d 2 x dx 2 d n 1 dx n ^ { 

dt 2 dt dt n ~ l dt 


d n x dx n 
~dF = ~ch' 


(7.7) 


(7.8) 


Then using both (7.7) and (7.8), the single nth-order equation (7.6) can be transformed 
into 


dx , 

-dT = x >’ 


dx 2 

j = X 3 , 

dt 


** 0-1 r 
dt 

dx 

-77= -a n (t)xi - a„_,( t)x 2 - ■■■ - a l (t)x„ + F(t), 


(7.9) 


which is a special case of the normal linear system (7.5) of n equations in n unknown 
functions. Thus we see that a single nth-order linear differential equation of form (7.6) is 
one unknown function is indeed intimately related to a normal linear system (7.5) of n 
first-order differential equation in n unknown functions. 
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B. Differential Operators 

In this section we shall present a symbolic operator method for solving linear systems 
with constant coefficients. This method depends upon the use of so-called differential 
operators , which we now introduce. 

Let x be an rc-times differentiable function of the independent variable t. We denote 
the operation of differentiation with respect to t by the symbol D and call D a 
differential operator. In terms of this differential operator the derivative dx/dt is 
denoted by Dx. That is, 

Dx = dx/dt . 

In like manner, we denote the second derivative of x with respect to t by D 2 x. Extending 
this, we denote the nth derivative of x with respect to t by D n x. That is, 

d n x 

D " x = -^r = 

Further extending this operator notation, we write 

, ^ , dx 

(D + c)x to denote — + cx 
dt 


and 


/j n y /j m v 

(aD n + bD m )x to denote a — — + b — 

dt dt 


where a, b , and c are constants. 

In this notation the general linear differential expression with constant coefficients 
Oq, a i, . . . , a n - 1 , a n , 


d n x d n l x dx 

h a i r- 4- • • • 4* i h a„x, 

0 dt n 1 dt n ~ i n i dt n 


is written 

(a 0 D n + a x D n ~ l + •■ ’ + a n - x D + a n )x. 

Observe carefully that the operators D n , D n ~ . . . , D in this expression do not represent 
quantities that are to be multiplied by the function x, but rather they indicate operations 
(differentiations) that are to be carried out upon this function. The expression 

a$D n + a i D n 1 + + + 

by itself, where a 0 , a l9 . . . , l9 a n are constants, is called a linear differential operator 
with constant coefficients. 


► Example 7.1 

Consider the linear differential operator 

3D 2 + 5D — 2. 
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If x is a twice differentiable function of t 9 then 

d 2 x dx 

(3D 2 + 5 D — 2)x denotes 3 — T + 5 — 2x. 

dt 2 dt 

For example, if x = t 3 , we have 

(3D 2 + 5D - 2)t 3 = 3-L (f 3 ) + 5 j (t 3 ) - 2(t 3 ) = 18r + 15r 2 — 2t\ 

We shall now discuss certain useful properties of the linear differential operator with 
constant coefficients. In order to facilitate our discussion, we shall let L denote this 
operator. That is, 

L = a 0 D n + D n 1 -f • • • 4- a n - 1 D -+- a n , 

where a 0 , a { , . . . , a n _ { , a n are constants. Now suppose that and f 2 are both n-times 
differentiable functions of t and c { and c 2 are constants. Then it can be shown that 

^[ C l/l + C lfl\ ~ C l^[/l] + 

For example, if the operator L = 3D 2 + 5D — 2 is applied to 3 1 2 + 2 sin t , then 
L[3t 2 + 2 sin r] = 3 L[r 2 ] + 2L[sin t] 


or 

(3D 2 + 5 D - 2)(3r 2 + 2 sin t) = 3(3D 2 + 5 D- 2 )t 2 + 2(3D 2 + 5 D- 2)sin t. 
Now let 

Li = a 0 D m + a l D m 1 + * * * + a m _ X D + a m 
and 


L 2 = b 0 D" + b l D"~ i + •• 

' + b„~ iD + b„ 


be two linear differential operators with constant coefficients a 0 ,a l ,.. 
b 0 ,b l ,...,b n - l ,b n , respectively. Let 

a m -i, fl m ,and 

L^r) = a 0 r m + a 1 r m 1 + • 

•• + flm-i r + a m 


and 



L 2 (r) = b 0 r n + b 1 r"- 1 + • 

'• + K-i r + b„ 



be the two polynomials in the quantity r obtained from the operators L { and L 2 , 
respectively, by formally replacing D by r, D 2 by r 2 ,..., D k by r\ Let us denote the 
product of the polynomials L { (r) and L 2 (r) by L(r); that is, 

Ur) ) = L x (r)LM 

Then, if / is a function possessing n + m derivatives, it can be shown that 

L x L 2 f = L 2 LJ = Lf 9 (7.10) 

where L is the operator obtained from the “product polynomial” L(r) by formally 
replacing r by D, r 2 by D 2 , . . . , r m+n by D m+n . Equation (7.10) indicates two important 
properties of linear differential operators with constant coefficients. First, it states the 
effect of first operating on / by L 2 and then operating on the resulting function by L i is 
the same as that which results from first operating on / by and then operating on 
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this resulting function by L 2 . Second, Equation (7.10) states that the effect of first 
operating on / by either or L 2 and then operating on the resulting function by the 
other is the same as that which results from operating on / by the “product operator” L. 
We illustrate these important properties in the following example. 


► Example 7.2 

Let Li = D 2 + 1, L 2 = 3D + 2, f(t) = t 3 . Then 

L.LJ = (D 2 + 1)(3D + 2 )t 3 = (D 2 + l)(9t 2 + 2 t 3 ) 

= 9 (D 2 + \)t 2 + 2 (D 2 + 1 )t 3 
= 9(2 + t 2 ) + 2(6 1 + t 3 ) = 2 1 3 + 9t 2 + 12r + 18 
and 

L 2 LJ = (3D + 2)(D 2 + 1 )t 3 = (3D + 2)(6 1 + t 3 ) 

= 6(3 D + 2)t + (3D + 2)r 3 
= 6(3 + 2 1) + (9 1 2 + 2t 3 ) = 2t 3 + 9t 2 + 12t + 18. 

Finally, L = 3D 3 + 2D 2 + 3D + 2 and 

L/ = (3D 3 + 2D 2 + 3D + 2)t 3 = 3(6) + 2(6t) + 3(3t 2 ) + 2 1 3 
= 2t 3 + 9t 2 + 12t + 18. 

Now let L = a 0 D n + a l D n ~ 1 + ••• + a n - x D + a w , where a 0 , a i9 ... 9 a n - 19 a n are 
constants; and let L(r) = a 0 r n + air” -1 + • • ■ + a„_ x r + a n be the polynomial in r 
obtained from L by formally replacing D by r, D 2 by r 2 , . . . , D" by r". Let r i? r 2 , . . . , r n be 
the roots of the polynomial equation L(r) = 0. Then L(r) may be written in the factored 
form 

Uf) = a 0 (r - r x )(r - r 2 )-- (r - r„). 

Now formally replacing r by D in the right member of this identity, we may express the 
operator L = a 0 D” + a x D n ~ 1 + • • • + a n _ { D + a„ in the factored form 

L = a 0 (D - r X )(D - r 2 ) *(D - r„). 

We thus observe that linear differential operators with constant coefficients can be 
formally multiplied and factored exactly as if they were polynomials in the algebraic 
quantity D. 


C. An Operator Method for Linear Systems with Constant Coefficients 

We now proceed to explain a symbolic operator method for solving linear systems with 
constant coefficients. We shall outline the procedure of this method on a strictly formal 
basis and shall make no attempt to justify it. 

We consider a linear system of the form 

L,x + L 2 y = f,(t), 

L 3 x + L 4 y = f 2 (t), ( 7 ' n * 

where L 1? L 2 , L 3 , and L 4 are linear differential operators with constant coefficients. 
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That is, Lj, L 2 , L 3 , and L 4 are operators of the forms 

Li = a 0 D m -f a x D m 1 + ••• + a m -\D + 

L 2 = b 0 D n + b l D n 1 + b n -iD + b n9 

L 3 = 0CqD p + oc^D p 1 + ■ * ■ + &p- \D -f ct p9 

L4. = PoD q + PlD q ~ 1 + * * * + P q - iD + P q9 

where the a’s, b% a’s, and /Ts are constants. 

A simple example of a system which may be expressed in the form (7.1 1) is provided 
by 


^ dx dy 

2 — — 2 — — 3 x = t 9 
dt dt 


^ dx ^ dy 

2 — — K 2 — — f- 3x + 8y = 2. 
dt dt 

Introducing operator notation this system takes the form 

(2Z> - 3)x - 2 Dy = t 9 
(2D + 3)x + (2D + 8 )y = 2. 

This is clearly of the form (7.11), where L i =2D — 3, L 2 = —2D, L 3 = 2D + 3, and 
L 4 = 2D + 8. 

Returning now to the general system (7.11), we apply the operator L 4 to the first 
equation of (7.11) and the operator L 2 to the second equation of (7.11), obtaining 

L 4 L x x + L Ar L 2 y = Wi, 

L 2 L 3 x + T 2 L 4 y = L 2 / 2 . 

We now subtract the second of these equations from the first. Since L 4 L 2 y = L 2 L 4 y, 
we obtain 

L 4 LiX — L 2 L 3 x = L 4 /l — T 2 / 2 , 


or 


(LiL* - L 2 L 3 )x = L 4 / x - L 2 / 2 . (7.12) 

The expression L t L 4 - L 2 L 3 in the left member of this equation is itself a linear 
differential operator with constant coefficients. We assume that it is neither zero nor a 
nonzero constant and denote it by L 5 . If we further assume that the functions 
and f 2 are such that the right member L^f x - L 2 f 2 of (7.12) exists, then this member 
is some function, say g i9 of t. Then Equation (7.12) may be written 

L 5 x = Ql . (7.13) 

Equation (7.13) is a linear differential equation with constant coefficients in the single 
dependent variable x. We thus observe that our procedure has eliminated the other 
dependent variable y. We now solve the differential equation (7.13) for x using the 
methods developed in Chapter 4. Suppose Equation (7.13) is of order N. Then the 
general solution of (7.13) is of the form 


x = c l u l + c 2 u 2 + • • • + c N u N + U l9 


(7.14) 
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where u l9 u 2 ,...,u N are N linearly independent solutions of the homogeneous linear 
equation L 5 x = 0, c l9 c 2 , . . . , c N are arbitrary constants, and U x is a particular solution 
of L 5 x = g { . 

We again return to the system (7.1 1) and this time apply the operators L 3 and to 
the first and second equations, respectively, of the system. We obtain 


L 3 L,x + L 3 L 2 y= LJi, 

L1L3X + L l L A y = L 1 f 1 . 

Subtracting the first of these from the second, we obtain 

(L 1 L 4 -L 2 L 3 )y = L 1 / 2 -L 3 / 1 . 

Assuming that and f 2 are such that the right member L { f 2 — L 3 / t of this equation 
exists, we may express it as some function, say g 2 , of t. Then this equation may be 
written 


L 5 y = g 2 , (7.15) 

where L 5 denotes the operator LjL 4 - L 2 L 3 . Equation (7.15) is a linear differential 
equation with constant coefficients in the single dependent variable y. This time we 
have eliminated the dependent variable x. Solving the differential equation (7.15) for y, 
we obtain its general solution in the form 

y = k^iii + k 2 u 2 + ■” + k N u N + U 2 , (7.16) 

where u l9 w 2 , . . . , u N are the N linearly independent solutions of L 5 y = 0 (or L 5 x = 0) 
that already appear in (7.14), k u k 2 ,..., k N are arbitrary constants, and U 2 is a 
particular solution of L 5 y = g 2 . 

We thus see that if x and y satisfy the linear system (7.1 1), then x satisfies the single 
linear differential equation (7.13) and y satisfies the single linear differential equation 
(7.15). Thus if xand y satisfy the system (7.11), then x is of the form (7.14) and y is of the 
form (7.16). However, the pairs of functions given by (7.14) and (7.16) do not satisfy the 
given system (7. 1 1) for all choices of the constants Ci,c 2 ,...,c N ,k l ,k 2 ,...,k N . That is, 
these pairs (7.14) and (7.16) do not simultaneously satisfy both equations of the given 
system (7.11) for arbitrary choices of the 2 N constants c l9 c 2 ,.. ., c N , k l9 fc 2 ,. k N . In 
other words, in order for x given by (7.14) and y given by (7.16) to satisfy the given 
system (7.11), the 2 N constants c l 9 c 29 ... 9 c N 9 k l 9 k 2 ,... 9 k N cannot all be independent 
but rather certain of them must be dependent on the others. It can be shown that the 
number of independent constants in the so-called general solution of the linear system 
(7. 11) is equal to the order of the operator L { L 4 — L 2 L 3 obtained from the determinant 

Li L 2 

l 3 l 4 

of the operator “coefficients” of x and y in (7.1 1), provided that this determinant is not 
zero. We have assumed that this operator is of order N. Thus in order for the pair (7. 14) 
and (7.16) to satisfy the system (7.11) only N of the 2 N constants in this pair can 
be independent. The remaining N constants must depend upon the N that are 
independent. In order to determine which of these 2 N constants may be chosen as 
independent and how the remaining N then relate to the N so chosen, we must 
substitute x as given by (7.14) and y as given by (7.16) into the system (7.1 1). This de- 
termines the relations that must exist among the constants c l ,c 2 ,...,c N ,k l ,k 2 ,...,k N 
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in order that the pair (7.14) and (7.16) constitute the so-called general solution of 
(7.1 1). Once this has been done, appropriate substitutions based on these relations are 
made in (7.14) and/or (7.16) and then the resulting pair (7.14) and (7.16) contain the 
required number N of arbitrary constants and so does indeed constitute the so-called 
general solution of system (7.11). 

We now illustrate the above procedure with an example. 


(7.17) 


► Example 7.3 

Solve the system 

- dx dy 

2 T" + 2 + 3x + Sy = 2. 

dt dt 

We introduce operator notation and write this system in the form 

(2D - 3)x - 2 Dy = t, 

(2D + 3)x + (2D + S)y = 2. 

We apply the operator (2D + 8) to the first equation of (7. 1 8) and the operator 2D to the 
second equation of (7.18), obtaining 

(2D + 8)(2 D - 3)x - (2D + 8)2 Dy = (2D + 8)f, 

2D(2D + 3)x + 2D(2D + 8 )y = (2D)2. 

Adding these two equations, we obtain 

[(2D + 8)(2D - 3) + 2D(2D + 3)]x = (2D + 8 )t + (2D)2 


(7.18) 


or 


(8D 2 + 16D - 24)x = 2 + St + 0 


or, finally 

(D 2 + 2D - 3)x = t + i 

The general solution of the differential equation (7.19) is 

x = c 1 e‘ + c 2 e- 3 ' -jt -U. 


(7.19) 


(7.20) 


We now return to the system (7. 1 8) and apply the operator (2D + 3) to the first equation 
of (7.18) and the operator (2D — 3) to the second equation of (7.18). We obtain 

(2D + 3)(2D - 3)x - (2D + 3)2Dy = (2D + 3 )t, 

(2D - 3)(2D + 3)x + (2D - 3)(2D + 8 )y = (2D - 3)2. 

Subtracting the first of these equations from the second, we have 

[(2D - 3)(2D + 8) + (2D + 3)2D]y = (2D - 3)2 - (2D + 3)t 

or 


(8 D 2 + 16D - 24)y = 0- 6- 2- 3t 
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or, finally, 


( D 2 + 2D - 3 )y = -f f - 1. 


The general solution of the differential equation (7.21) is 


y = k l e t + k 2 e 3t + it + 


(7.21) 

(7.22) 


Thus if x and y satisfy the system (7. 17), then x must be of the form (7.20) and y must 
be of the form (7.22) for some choice of the constants c l ,c 2 ,k l ,k 2 . The determinant of 
the operator “coefficients” of x and y in (7.18) is 


2D - 3 
2D + 3 


-2D 
2D + 8 


= 8D 2 + 16 D - 24. 


Since this is of order 2, the number of independent constants in the general solution of 
the system (7.17) must also be two. Thus in order for the pair (7.20) and (7.22) to satisfy 
the system (7.17) only two of the four constants c l ,c 2 ,k l , and k 2 can be independent. In 
order to determine the necessary relations that must exist among these constants, we 
substitute x as given by (7.20) and y as given by (7.22) into the system (7.17). 
Substituting into the first equation of (7.17), we have 

[2 c x e l — 6 c 2 e~ 3t — f ] — [2/^e* - 6k 2 e~ 3t + i] - [3 c x c' + 3c 2 e~ 3t — t — = t 

or 

(— - 2k l )e t + ( — 9c 2 + 6k 2 )e~ 3t = 0. 


Thus in order that the pair (7.20) and (7.22) satisfy the first equation of the system (7.17) 
we must have 


-c l - 2k { = 0, 
-9c 2 + 6/c 2 = 0. 


(7.23) 


Substitution of x and y into the second equation of the system (7.17) will lead to 
relations equivalent to (7.23). Hence in order for the pair (7.20) and (7.22) to satisfy the 
system (7.17), the relations (7.23) must be satisfied. Two of the four constants m (7.23) 
must be chosen as independent. If we choose c x and c 2 as independent, then we have 


ki = —jc i and k 2 = |c 2 . 


Using these values for k i and k 2 in (7.22), the resulting pair (7.20) and (7.22) constitute 
the general solution of the system (7.17). That is, the general solution of (7.17) is given 
by 

x = c 1 e‘ + c 2 e~ 3 ' 

y = -jCie‘ + fc 2 e~ 31 + it + -h. 


where c { and c 2 are arbitrary constants. If we had chosen k { and k 2 as the independent 
constants in (7.23), then the general solution of the system (7.17) would have been 
written 
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An Alternative Procedure. Here we present an alternative procedure for solving 
a linear system of the form 

L x x + L 2 y = fft), 

L 3 x + L A y = f 2 (t), (7J1) 

where L u L 2 , L 3 , and L 4 are linear differential operators with constant coefficients. 
This alternative procedure begins in exactly the same way as the procedure already 
described. That is, we first apply the operator L 4 to the first equation of (7.1 1) and the 
operator L 2 to the second equation of (7.1 1), obtaining 

L 4 LiX + L 4 L 2 y = L 4 / 1? 

^2^3* "1" ^2^4.V = ^ 2 / 2 * 

We next subtract the second from the first, obtaining 

(L t L 4 - L 2 L 3 )x = L 4 /i - L 2 / 2 , (7.12) 

which, under the same assumptions as we previously made at this point, may be written 

L 5 x = g { . (7.13) 

Then we solve this single linear differential equation with constant coefficients in the 
single dependent variable x. Assuming its order is N, we obtain its general solution in 
the form 

X = C^Ui -h C 2 U 2 + *** + “b Ui, (7.14) 

where u u u 2 ,..., u N are N linearly independent solutions of the homogeneous linear 
equation L 5 x = 0,c 1 ,c 2 ,...,c N are N arbitrary constants, and U { is a particular 
solution of L 5 x = g { . 

Up to this point, we have indeed proceeded just exactly as before. However, we now 
return to system (7.11) and attempt to eliminate from it all terms that involve the 
derivatives of the other dependent variable y . In other words, we attempt to obtain 
from system (7.11) a relation R that involves the still unknown y but none of the 
derivatives of y. This relation R will involve x and/or certain of the derivatives of x; but 
x is given by (7.14) and its derivatives can readily be found from (7.14). Finding these 
derivatives of x and substituting them and the known x itself into the relation R , we see 
that the result is merely a single linear algebraic equation in the one unknown y . 
Solving it, we thus determine y without the need to find (7. 1 5) and (7. 1 6) or to relate the 
arbitrary constants. 

As we shall see, this alternative procedure always applies in an easy straightforward 
manner if the operators L u L 2 , L 3 , and L 4 are all of the first order. However, for 
systems involving one or more higher-order operators, it is generally difficult to 
eliminate all the derivatives o £y. 

We now give an explicit presentation of the procedure for finding y when L { , L 2 , L 3 , 
and L 4 are all first-order operators. 

Specifically, suppose 

L 1 = a$D -f- a± , 
i-»2 = b 0 D + bi , 

L 3 = ot 0 D + a l5 
Lt = p 0 D + p x . 
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Then (7.11) is 

(a 0 D + a^x + ( b 0 D + b t )y = f^t), 
(a 0 D + a x )x + (P 0 D + Pi)y = f 2 (t). 


(7.24) 


Multiplying the first equation of (7.24) by [l 0 and the second by — b 0 and adding, we 
obtain 


[(Mo - b 0 a 0 )D + ( aj 0 - Mi)]* + (bj 0 - Mi))' = P 0 fi(t) - b 0 f 2 (t). 

Note that this involves y but none of the derivatives of y. From this, we at once obtain 

_ (Mo ~ Mo)P* + (Mi ~ Mo)* + PofM ~ Kfi (0 n 

^ b 1 p 0 -b 0 p i ’ } 

assuming b l f 0 — b 0 # 0. Now x is given by (7.14) and Dx may be found from (7.14) 
by straightforward differentiation. Then substituting these known expressions for x 
and Dx into (7.25), we at once obtain y without the need of obtaining (7.15) and (7.16) 
and hence without having to determine any relations between constants c { and 
ki(i = 1,2,..., N), as in the original procedure. 

We illustrate the alternative procedure by applying it to the system of Example 7.3. 


► Example 7.4 


Solve the system 

^ dx dy 

2— 2 — — 3 x = t, 

dt dt 

dx ^ dy 

2 — — h 2 — — h 3x + 8y = 2. 
dt dt 


of Example 7.3 by the alternative procedure that we have just described. 

Following this alternative procedure, we introduce operator notation and write the 
system (7.17) in the form 

(2D - 3)x - 2 Dy = t, 

(2D + 3)x + (2D + 8 )y = 2. 

Now we eliminate y, obtain the differential equation 

(D 2 + 2D - 3)x = t + ± 
for x, and find its general solution 

x = c l e t + c 2 e" 3r -it-^, 

exactly as in Example 7.3. 

We now proceed using the alternative method. We first obtain from (7.18) a relation 
that involves the unknown y but not the derivative Dy. The system (7.18) of this 
example is so very simple that we do so by merely adding the equations (7. 1 8). Doing so, 
we at once obtain 


(7.18) 

(7.19) 

(7.20) 


4Dx + Sy = t + 2, 



7.1 DIFFERENTIAL OPERATORS AND AN OPERATOR METHOD 277 


which does indeed involve y but not the derivative Dy , as desired. From this, we at once 
find 


From (7.20), we find 


y = i(r + 2 - 4 Dx). 


Dx = c l e' — 3c 2 e 31 — 3 . 


Substituting into (7.26), we get 

y = i(t + 2 - 40^' + 12 c 2 e -3 ' + f) 


= —jCie' + jc 2 e 3, + |t + ir- 


Thus the general solution of the system may be written 


x = c l e' + c 2 e 3 , -it- 3 b 
y = -ic t e‘ + jc 2 e~ 3 ' + it + -ft, 


where c t and c 2 are arbitrary constants. 


Exercises 


Use the operator method described in this section to find the general solution of each of 
the following linear systems. 

dx dy , , dx dy 

1. — + — 2x - 4y = e', 2. — + - — x = - 2 f, 

dt dt dtdt 

dx dy dx d y , 

-T + -T- y = e M . — + - 3x - y = t 2 . 

dt dt dtdt 

. dx dy . , . dx dy . „ , 

3. — + — — x — 3y = e , 4. — + — — x — 2y = 2e , 

dt dt dt dt 

dx dy ,, dx dy . . 2 , 

dt dt dt dt 


, . dx dy 

5 ■ 2 jt + i,- x - y = e ■ 
d ± + d -i + lx + y = e .. 

dt dt 

_ dx dy - ,, 

dt dt 
. dx dy 

9. — + -j- + 2y = sin t, 
dt dt 

dx dy 

— + -f-x-y = 0. 
dt dt 


, „ dx dy 

6 - 2 -jt + j,- 2x - y = t ’ 

± + d i- 4*-,-.'. 
dt dt 

„ dx dy „ 

8 - dr y T,- x - 3y ~ x 

~ + 2~ - 2x — 3y = 1 . 
dt dt 

, . dx dy 

,0 ' ¥-s- 2i + 4,,= '’ 

dx dy 

— + -y-~ x- y = l. 
dt dt 
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<< ~dx dy 

11. 2 — — h — — b x + 5y = 4t, 
dt dt 


dx dy _ 2 

rfF + ^~ x + 5y = ( ’ 


— — I — - — h 2x + 2y = 2. 
A dt 


dx -dy 

— — h 2 — — 2x -b Ay = 2t -b 1. 
dt dt 


.. -dx dy , 

13 - 2 ^ + ^ + x + >, = J +4t> 


. , _ dx _ dy 

14 3 dF + 2 ^~ x + = 


+ y" + 2x + 2y = 2t 2 — 2 1. 
dt dt 


dx dy 

— + -^-x = t + 2. 

dt dt 


. , . dx dy 

15. 2 — + 4-r- + x — y = 3e , 
dt dt 


16. 2^ + T 


x — y = —2t, 


dx dy . 

— + -f - + 2x + 2y = e‘. 
dt dt 

«_ ~dx dy 

17 ' 2 * + 57-'- , ’ =1 ' 


dx dy 0 

— + -f- + x - y = t 2 . 
dt dt 


d^ + d _L = e « 

dt 2 dt 


dx dy 

— + -f + 2x - y = t. 
dt dt 


d 2 x dy 

19 - dF + di~ x+y = l 


dx dy 

-r + -f-x- y = 0. 
dt dt 

d 2 x dy t 
dt 2 dt 


d 2 y dx 
dt 2 + dt 


j+~i7-x+y = 0. 


dx dy 

— + - — 4 x — y = 2e‘. 
dt dt 


.. d 2 x dy 
21 . —y ~-f = t + l, 
dt 2 dt 


d 2 x A dy , 

+ 4 ^ + x _ 4> , = 0 , 


dx dy - 

— + -r-3x+ y = 2 t- 1 . 
dt dt 


dx dy „ 

— + - — x + 9y = e 2 '. 
dt dt 7 


In each of Exercises 23-26, transform the single linear differential equation of the form 
(7.6) into a system of first-order differential equations of the form (7.9). 


_ d 2 x dx , 

23. — -=- — 3 — + 2x = t 2 . 

dt 2 dt 


d 3 x ^d 2 x dx _ 

24 ' 7F* 2 l?~H~ 2x “ ■ 


d 3 x d 2 x A ,dx 
25. —3- + t-r-y + 2t 3 — 
dt 3 dt 2 dt 


5t 4 = 0. 


t 2 — =r + 2 tx = cos t. 
dt 2 


7.2 APPLICATIONS 

A. Applications to Mechanics 

Systems of linear differential equations originate in the mathematical formulation of 
numerous problems in mechanics. We consider one such problem in the following 
example. Another mechanics problem leading to a linear system in given in the 
exercises at the end of this section. 
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► Example 7.5 

On a smooth horizontal plane BC (for example, a smooth table top) an object A l is 
connected to a fixed point P by a massless spring Si of natural length L 1 . An object A 2 
is then connected to A x by a massless spring S 2 of natural length L 2 in such a way that 
the fixed point P and the centers of gravity A t and A 2 all lie in a straight line (see 
Figure 7.1). 


B 


! si 


s 2 ggii 


T 





I 

0i 


L2 


Figure 7.1 


The object A t is then displaced a distance a x to the right or left of its equilibrium 
position O l9 the object A 2 is displaced a distance a 2 to the right or left of its equilibrium 
position 0 2 , and at time t = 0 the two objects are released (see Figure 7.2). What are the 
positions of the two objects at any time t > 0? 



0 1 02 


Figure 7.2 


Formulation. We assume first that the plane BC is so smooth that frictional forces 
may be neglected. We also assume that no external forces act upon the system. Suppose 
object A x has mass mi and object A 2 has mass m 2 . Further suppose spring Si has spring 
constant k x and spring S 2 has spring constant k 2 . 

Let Xi denote the displacement of A i from its equilibrium position Oj at time t > 0 
and assume that Xj is positive when A i is to the right of 0 t . In like manner, let x 2 
denote the displacement of A 2 from its equilibrium position 0 2 at time t > 0 and 
assume that x 2 is positive when A 2 is to the right of 0 2 (see Figure 7.3). 



Figure 7.3 
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Consider the forces acting on A l at time t > 0. There are two such forces, F t and F 2 , 
where F t is exerted by spring S t and F 2 is exerted by spring S 2 . By Hooke’s law 
(Section 5.1) the force F t is of magnitude k l \x i |. Since this force is exerted toward the 
left when >l i is to the right of and toward the right when A t is to the left of O l9 we 
have F t = —k i x l . Again using Hooke’s law, the force F 2 is of magnitude fc 2 s, where s is 
the elongation of S 2 at time t. Since s = |x 2 — x t |, we see that the magnitude of F 2 is 
k 2 \x 2 — x t |. Further, since this force is exerted toward the left when x 2 — x t < 0 and 
toward the right when x 2 — x t > 0, we see that F 2 = k 2 (x 2 — xj. 

Now applying Newton’s second law (Section 3.2) to the object A l9 we obtain the 
differential equation 




d 2 x i 


-k l x l + k 2 (x 2 - x,). 


(7.27) 


We now turn to the object A 2 and consider the forces that act upon it at time t > 0. 
There is one such force, F 3 , and this is exerted by spring S 2 . Applying Hooke’s law once 
again, we observe that this force is also of magnitude k 2 s = /c 2 |x 2 — xj. Since F 3 is 
exerted toward the left when x 2 — x Y >0 and toward the right when x 2 — x i < 0, we 
see that F 3 = — k(x 2 - x x ). Applying Newton’s second law to the object A l9 we obtain 
the differential equation 

m i d ^=-k 2 (x 2 -x,). (7.28) 


In addition to the differential equations (7.27) and (7.28), we see from the statement 
of the problem that the initial conditions are given by 


x 1 (0) = « 1 , x;(0) = 0, x 2 (0 ) = fl2 , x 2 (0) = 0. (7.29) 


The mathematical formulation of the problem thus consists of the differential 
equations (7.27) and (7.28) and the initial conditions (7.29). Writing the differential 
equations in the form 


m l 


d 2 x l 
dt 2 


+ (k t + k 2 )x i — k 2 x 2 = 0, 


m 2 


d 2 x 2 

~dF 


— k 2 x Y + k 2 x 2 = 0, 


(7.30) 


we see that they form a system of homogeneous linear differential equations with 
constant coefficients. 


Solution of a Specific Case. Rather than solve the general problem consisting of 
the system (7.30) and conditions (7.29), we shall carry through the solution in a 
particular case that was chosen to facilitate the work. Suppose the two objects A l and 
^i 2 are each of unit mass, so that m l = m 2 = 1. Further, suppose that the springs and 
S 2 have spring constants = 3 and k 2 = 2, respectively. Also, we shall take a i = - 1 
and a 2 = 2. Then the system (7.30) reduces to 


d 2 x x 

~dF 


+ 5x! 


— 2x 2 = 0, 


d 2 x 2 

~dF 


2x t + 2x 2 = 0, 


(7.31) 
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and the initial conditions (7.29) become 

x^O) = - 1, x;(0) = 0, x 2 (0) = 2, x' 2 (0) = 0. (7.32) 

Writing the system (7.31) in operator notation, we have 

( D 2 + 5)x t — 2 x 2 = 0, 

— 2x x + (D 2 + 2)x 2 = 0. (7-33) 

We apply the operator (D 2 + 2) to the first equation of (7.33), multiply the second 
equation of (7.33) by 2, and add the two equations to obtain 

[(D 2 + 2)(D 2 + 5) - 4]xj = 0 


or 


(D 4 + 7 D 2 + 6)x t = 0. (7.34) 

The auxiliary equation corresponding to the fourth-order differential equation (7.34) is 

m 4 -f 1m 1 + 6 = 0 or (m 2 + 6)(m 2 + 1) = 0. 

Thus the general solution of the differential equation (7.34) is 

x x = c t sin t + c 2 cos t + c 3 sin yj~6t + c 4 cos y/6t. (7.35) 

We now multiply the first equation of (7.33) by 2, apply the operator (D 2 + 5) to the 
second equation of (7.33), and add to obtain the differential equation 


(D 4 + ID 2 + 6)x 2 = 0 

for x 2 . The general solution of (7.36) is clearly 

x 2 = sin t + k 2 cos t + k 3 sin y/6t + fc 4 cos y/6t. 
The determinant of the operator “coelhcients” in the system (7.33) is 

\D 2 + 5 -2 

-2 D 2 + 2 


(7.36) 

(7.37) 


= D 4 + ID 2 + 6. 


Since this is a fourth-order operator, the general solution of (7.31) must contain four 
independent constants. We must substitute given by (7.35) and x 2 given by (7.37) into 
the equations of the system (7.31) to determine the relations that must exist among the 
constants c i9 c 2 , c 3 , c 4 , k i9 k 2 , fc 3 , and fc 4 in order that the pair (7.35) and (7.37) 
represent the general solution of (7.31). Substituting, we find that 

k\ = 2 c 1? k 2 = 2c 2 , fc 3 = — 2 c 3 , fc 4 = —jc 4 . 

Thus the general solution of the system (7.31) is given by 

x t = c t sin t + c 2 cos t + c 3 sin y/6t + c 4 cos x /6t, 

x 2 = 2c t sin t + 2c 2 cos t — jc 3 sin x / / 6 1 — jc 4 cos x / / 6 1 . 


(7.38) 


We now apply the initial conditions (7.32). Applying the conditions x t = — 1, 
dx l /dt = 0 at t = 0 to the first of the pair (7.38), we find 

-1 = c 2 + c 4 , 

0 = c t + y/6c 3 . 


(7.39) 
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Applying the conditions x 2 = 2, dx 2 /dt = 0 at t = 0 to the second of the pair (7.38), we 
obtain 


2 — 2c 2 2^4> 


0 = 2 c i 



(7.40) 


From Equations (7.39) and (7.40), we find that ' 

Cl =0, c 2 = I, c 3 = 0, c 4 = -f . 

Thus the particular solution of the specific problem consisting of the system (7.31) and 
the conditions (7.32) is 

*i = fcos t — f cos y/6t, 
x 2 = f cos t + fcos y/6t. 


B. Applications to Electric Circuits 

In Section 5.6 we considered the application of differential equations to electric circuits 
consisting of a single closed path. A closed path in an electrical network is called a loop. 
We shall now consider electrical networks that consist of several loops. For example, 
consider the network shown in Figure 7.4. 



Figure 7.4 


This network consists of the three loops ABMNA , BJKMB , and ABJKMNA. 
Points such as B and M at which two or more circuits join are called junction points or 
branch points. The direction of current flow has been arbitrarily assigned and indicated 
by arrows. 

In order to solve problems involving multiple loop networks we shall need two 
fundamental laws of circuit theory. One of these is Kirchhoff’s voltage law, which we 
have already stated and applied in Section 5.6. The other basic law that we shall employ 
is the following: 

Kirchhoff's Current Law. In an electrical network the total current flowing into a 
junction point is equal to the total current flowing away from the junction point. 
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As an application of these laws we consider the following problem dealing with the 
circuit of Figure 7.4. 


► Example 7.6 

Determine the currents in the electrical network of Figure 7.4, if E is an electromotive 
force of 30 V, is a resistor of 10 Q, R 2 is a resistor of 20 Q, L i is an inductor of 

0. 02. H, L 2 is an inductor of 0.04 H, and the currents are initially zero. 

Formulation. The current flowing in the branch MNAB is denoted by i, that 
flowing on the branch BM by i t , and that flowing on the branch BJKM by i 2 . 

We now apply Kirchhoff’s voltage law (Section 5.6) to each of the three loops 
ABMNA , BJKMB , and ABJKMNA . 

For the loop ABMNA the voltage drops are as follows: 

1. Across the resistor R t : lOi. 

di i 

2. Across the inductor L,: 0.02 — . 

dt 

Thus applying the voltage law to the loop ABMNA , we have the equation 

0.02 ‘-y- + 10/ = 30. (7.41) 

dt 

For the loop BJKMB , the voltage drops are as follows: 

1. Across the resistor R 2 : 20 i 2 . 

di 2 

2. Across the inductor L 2 : 0.04 — . 

dt 

dii 

3. Across the inductor L t : — 0.02 — . 

dt 

The minus sign enters into 3 since we traverse the branch MB in the direction opposite 
to that of the current i l as we complete the loop BJKMB. Since the loop BJKMB 
contains no electromotive force, upon applying the voltage law to this loop we obtain 
the equation 

- 0.02 ~ + 0.04 ^ + 20i 2 = 0. (7.42) 

dt dt 

For the loop ABJKMNA, the voltage drops are as follows: 

1. Across the resistor R t : 10 i. 

2. Across the resistor R 2 : 20 i 2 . 

di 2 

3. Across the inductor L 2 : 0.04 . 

dt 
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Applying the voltage law to this loop, we obtain the equation 

10; + 0.04 ^ + 20; 2 = 30. 
dt 


(7.43) 


We observe that the three equations (7.41), (7.42), and (7.43) are not all independent. 
For example, we note that (7.42) may be obtained by subtracting (7.41) from (7.43). 
Thus we need to retain only the two equations (7.41) and (7.43). 

We now apply Kirchhoff’s current law to the junction point B. From this we see at 
once that 

i = ii + i 2 - (7.44) 

In accordance with this we replace i by i x -f i 2 in (7.41) and (7.43) and thus obtain the 
linear system 


0.02 + 10;, + 10;' 2 = 30, 

at 

10;. + 0.04 ^ + 30; 2 = 30. 
at 


(7.45) 


Since the currents are initially zero, we have the initial conditions 

;',(0) = 0 and ; 2 (0) = 0. 

Solution. We introduce operator notation and write the system (7.45) in the form 

(0.02D + 10)1! + 10f 2 = 30, 

10; i + (0.04D + 30) ; 2 = 30. 


(7.46) 


(7.47) 


We apply the operator (0.04Z) + 30) to the first equation of (7.47), multiply the 
second by 10, and subtract to obtain 

[(0.040 + 30)(0.02D + 10) - 100];, = (0.04D + 30)30 - 300 
(0.0008D 2 + D + 200);, = 600 


or 


or, finally, 

(£> 2 + 1250 D + 250,000);, = 750,000. (7.48) 

We now solve the differential equation (7.48) for ;, . The auxiliary equation is 

m 2 + 1250m + 250,000 = 0 
or 

(m + 250)(m + 1000) = 0. 

Thus the complementary function of Equation (7.48) is 

;,. c = c l e~ 250 ' + c 2 e~ 1000 ‘, 

and a particular integral is obviously ;, p = 3. Hence the general solution of the 
differential equation (7.48) is 

_ . „-250( _j_ „ „-1000t 


;, = c,e 


+ c 2 e‘ 


+ 3. 


(7.49) 
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Now returning to the system (7.47), we multiply the first equation of the system by 
10, apply the operator (0.02 -f 10) to the second equation, and subtract the first from 
the second. After simplifications we obtain the differential equation 

(D 2 + 1250D + 2 50,000) i 2 = 0 

for i 2 . The general solution of this differential equation is clearly 

i 2 = k ie - 250t + k 2 e- l000t . (7.50) 

Since the determinant of the operator “coefficients” in the system (7.47) is a second- 
order operator, the general solution of the system (7.45) must contain two independent 
constants. We must substitute i t given by (7.49) and i 2 given by (7.50) into the equations 
of the system (7.45) to determine the relations that must exist among the constants 
Ci, c 2 , fci, k 2 in order that the pair (7.49) and (7.50) represent the general solution of 
(7.45). Substituting, we find that 

ki = ~ic u k 2 = c 2 . (7.51) 

Thus the general solution of the system (7.45) is given by 

i, = c 1 e- 250 ' + c 2 e- 1000 ' + 3, 

» 2 = -ic ie - 250 ‘ + c 2 e- 1000 ‘. (7 ' 52) 

Now applying the initial conditions (7.46), we find that c t + c 2 + 3 = Oand -jc i + 
c 2 = 0 and hence c t = -2 and c 2 = — 1. Thus the solution of the linear system (7.45) 
that satisfies the conditions (7.46) is 

1 1 = —2e~ 250t — e~ i000t + 3, 

1 2 = e ~ 25 ot_ e -iooot 

Finally, using (7.44) we find that 

i = - e ~ 250t - 2e~ l000t + 3. 

We observe that the current i 2 rapidly approaches zero. On the other hand, the 
currents, i t and i = -f i 2 rapidly approach the value 3. 


C. Applications to Mixture Problems 

In Section 3.3C, we considered mixture involving the amount of a substance S in a 
mixture in a single container, into and from which there flowed mixtures containing S. 
Here we extend the problem to situations involving two containers. That is, we consider 
a substance S in mixtures in two interconnected containers, into and from which there 
flow mixtures containing S. The mixture in each container is kept uniform by stirring; t 
and we seek to determine the amount of substance S present in each container at time t. 

Let x denote the amount of S present in tank X at time t , and let y denote the amount 
of S present in tank Y at time t. We then apply the basic equation (3.55) of Section 3.3C 
in the case of each of the unknowns x and y. Now, however, the “IN” and “OUT” terms 
in each of the resulting equations also depends on just how the two containers are 
interconnected. We illustrate with the following example. 
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► Example 7.7 

Two tanks X and Y are interconnected (see Figure 7.5). Tank X initially contains 100 
liters of brine in which there is dissolved 5 kg of salt, and tank Y initially contains 
100 liters of brine in which there is dissolved 2 kg of salt. Starting at time r = 0,(1) pure 
water flows into tank X at the rate of 6 liters/min, (2) brine flows from tank X into tank 
Y at the rate of 8 liters/min, (3) brine is pumped from tank Y back into tank X at the 
rate of 2 liters/min, and (4) brine flows out of tank Y and away from the system at the 
rate of 6 liters/min. The mixture in each tank is kept uniform by stirring. How much 
salt is in each tank at any time t > 0? 


2 liters brine/min 


it 


8 liters brine/min 


Brine 


6 liters brine/min 


Brine 


Tank X 


Tank Y 


Figure 7.5 


Formulation. Let x = the amount of salt in tank X at time t , and let y = the 
amount of salt in tank Y at time t, each measured in kilograms. Each of these tanks 
initially contains 100 liters of fluid, and fluid flows both in and out of each tank at the 
same rate, 8 liters/min, so each tank always contains 100 liters of fluid. Thus the 
concentration of salt at time t in tank X is x/100 (kg/liter) and that in tank Y is 
y/100 (kg/liters). 

The only salt entering tank X is in the brine that is pumped from tank Y back into 
tank X. Since this enters at the rate of 2 liters/min and contains y/100 kg/liter, the rate 
at which salt enters tank X is 2y/100. Similarly, the only salt leaving tank X is in the 
brine that flows from tank X into tank Y. Since this leaves at the rate of 8 liters/min and 
contains x/100 kg/liter, the rate at which salt leaves tank X is 8x/100. Thus we obtain 
the differential equation 


dx 2 y 8x 


(7.53) 


for the amount of salt in tank X at time t. In a similar way, we obtain the differential 
equation 


dy 8x 8y 
dt = m~m 


(7.54) 


for the amount of salt in tank Y at time t. Since initially there was 5 kg of salt in tank X 
and 2 kg in tank Y, we have the initial conditions 

x(0) = 5, y(0) = 2 (7.55) 

Thus we have the linear system consisting of differential equations (7.53) and (7.54) and 
initial conditions (7.55). 
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Solution. We introduce operator notation and write the differential equations 
(7.53) and (7.54) in the forms 


0 + tIo) x - 

( d + 4 ) j=0 ' 


(7.56) 


“loo * 4 


We apply the operator (£> + jfg) to the first equation of (7.56), multiply the second 
equation by y^o, and add to obtain 


D + 


8 

100 


D + 


8 


16 


iooy (ioo) 


r x = 0, 


which quickly reduces to 


1“ , 16 48 "I 

L + Ioo D + (wJ 


(7.57) 


We now solve the homogeneous differential equation (7.57) for x. The auxiliary 
equation is 

- 16 48 

m + Ioo m + (W“ 0, 


or 


m + 


T5o)(" ,+ 


12 

100 


= 0, 


with real distinct roots (— l)/25 and ( — 3)/25. Thus the general solution of equation 
(7.57) is 


x = Cl <r< 1/25) ‘ + c 2 e- (3/25)t . 


(7.58) 


Now applying the so-called alternative procedure of Section 7. 1C, we obtain from 
system (7.56) a relation that involves the unknown y but not the derivative Dy. The 
system (7.56) is so especially simple that the first equation of this system is itself such a 
relation. Solving this for j/, we at once obtain 


From (7.58), we find 


y = 50Dx + 4x. 


Dx — _fl c -(l/25)r_^£2 c -( 3 /25),_ 


(7.59) 


25 


25 


Substituting into (7.59), we get 

y = 2c 1 e~ <1/25) ‘ — 2c 2 c _(3/25)t . 

Thus the general solution of the system (7.56) is 

x = c 1 e -<1/25) ' + c 2 e~ (3,25)t , 
y = 2c 1 e -<1/25,( — 2c 2 e _<3/25,( . 


(7.60) 
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We now apply the initial conditions (7.55). We at once obtain 

Ci +c 2 = 5, 

2c i - lc 2 = 2, 

from which we find 

Ci =3, c 2 — 2. 

Thus the solution of the linear system (7.56) that satisfies the initial conditions (7.55) is 

x = 3c -(1/25)f + 2e~ (3/25)t , 
y = 6e~ il,25)t — 4e~ (3/25)t . 

These expressions give the amount of salt x in tank X and the amount y in tank Y, 
respectively, each measured in kilograms, at any time t (min) > 0. Thus, for example, 
after 25 min, we find 

x = 3e~ l + 2e~ 3 % 1.203 (kg), 
y = 6e~ l — 4e~ 3 % 2.008 (kg). 

Note that as t oo, both x and y -+ 0. Thus is in accordance with the fact that no salt at 
all (but only pure water) flows into the system from outside. 


Exercises 


1. Solve the problem of Example 7.5 for the case in which the object A x has mass 
m x = 2, the object A 2 has mass m 2 = 1, the spring Si has spring constant k i = 4, 
the spring S 2 has spring constant k 2 = 2, and the initial conditions are x t (0) = 1, 
Xi(0) = 0, x 2 (0) = 5, and x' 2 (0) = 0. 

2. A projectile of mass m is fired into the air from a gun that is inclined at an angle 6 
with the horizontal, and suppose the initial velocity of the projectile is v 0 feet per 
second. Neglect all forces except that of gravity and the air resistance, and assume 
that this latter force (in pounds) is numerically equal to k times the velocity 
(in ft/sec). 

(a) Taking the origin at the position of the gun, with the x axis horizontal and the 
y axis vertical, show that the differential equations of the resulting motion are 


d 2 x 


dx 


m 


dt 


2 ^ 

2 dt 


d 2 y , dy 

m —y + k — + mg = 0. 
dt 2 dt 


(b) Find the solution of the system of differential equations of part (a). 

3. Determine the currents in the electrical network of Figure 7.6 if E is an 
electromotive force of 100 V, R x is a resistor of 20 Q, R 2 is a resistor of 40 Q, L x is 
an inductor of 0.01 H, L 2 is an inductor of 0.02 H, and the currents are initially 
zero. 
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Li L2 



4. Set up differential equations for the currents in each of the electrical networks 

shown in Figure 7.7. Do not solve the equations. 

(a) For the network in Figure 1.1a assume that E is an electromotive force of 
15 V, R is a resistor of 20 Q, L is an inductor of 0.02 H, and C is a capacitor of 
10“ 4 farads. 

(b) For the network in Figure 1.1b assume that E is an electromotive force of 
100 sin 130f V, is a resistor of 20 Q, R 2 is a resistor of 30 Q, and L is an 
inductor of 0.05 H. 

(c) For the network of Figure 7.7c assume that E is an electromotive force of 
100 V, JR ! is a resistor of 20 Q, R 2 is a resistor of 10 Q, Q is a capacitor of 10“ 4 
farads, and C 2 is a capacitor of 2 x 10“ 4 farads. 


R Ri 



Ri R% 



5. Two tanks are interconnected. Tank X initially contains 90 liters of brine in which 
there is dissolved 3 kg of salt, and tank Y initially contains 90 liters of brine in 
which there is dissolved 2 kg of salt. Starting at time t = 0, (1) pure water flows into 
tank X at the rate of 4.5 liters/min, (2) brine flows from tank X into tank Y at the 
rate of 6 liters/min, (3) brine is pumped from tank Y back into tank X at the rate of 
1.5 liters/min, and (4) brine flows out of tank Y and away from the system at the 
rate of 4.5 liters/min. The mixture in each tank is kept uniform by stirring. How 
much salt is in each tank at any time t > 0? 

6. Two tanks X and Y are interconnected. Tank X initially contains 30 liters of brine 
in which there is dissolved 30 kg of salt, and tank Y initially contains 30 liters of 
pure water. Starting at time t = 0, (1) brine containing 1 kg of salt per liter flows 
into tank X at the rate of 2 liters/min and pure water also flows into tank X at the 
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rate of 1 liter/min, (2) brine flows from tank X into tank Y at the rate of 
4 liters/min, (3) brine is pumped from tank Y back into tank X at the rate of 
1 liter/min, and (4) brine flows out of tank Y and away from the system at the rate 
of 3 liters/min. The mixture in each tank is kept uniform by stirring. How much 
salt is in each tank at any time t > 0? 


7.3 BASIC THEORY OF LINEAR SYSTEMS IN NORMAL FORM: 
TWO EQUATIONS IN TWO UNKNOWN FUNCTIONS 

A. Introduction 


We shall begin by considering a basic type of system of two linear differential equations 
in two unknown functions. This system is of the form 


— = a n (t)x + a 12 (t)>' + F 1 (t), 

dy 

— = a 2l (t)x + a 22 (t)y + F 2 (t). 


(7.61) 


We shall assume that the functions a n , a 12 , F u a 21 , a 22 ,andF 2 areall continuous on a 
real interval a < t < b. If F i (t) and F 2 (t) are zero for all t, then the system (7.61) is called 
homogeneous ; otherwise, the system is said to be nonhomogeneous. 


► Example 7.8 


The system 


is homogeneous; the system 


dx 

-ji =2x ~ y - 


dy 

dt 


= 3x + 6 y. 


dx 

It 


= 2x — y — 5t, 


dy 

i = 3 * + 6 ,- 4 , 


is nonhomogeneous. 


(7.62) 


(7.63) 


DEFINITION 

By a solution of the system (7.61) we shall mean an ordered pair of real functions 

(fg), (7.64) 
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each having a continuous derivative on the real interval a < t < b, such that 

df(t) 


dt 

dg(t) 

dt 


= flnW/W + a l2 (t)g(t) + 

= a 2 i (0/(0 + a 22 (t)g(t) + F 2 (t), 


for all t such that a < t < b. In other words , 

^ = fit), 
y = git), 

simultaneously satisfy both equations of the system (7.61) identically for a < t < b. 


(7.65) 


(7.65) 


Notation. We shall use the notation 

x = fit), 
y = git), 

to denote a solution of the system (7.61) and shall speak of “the solution 

* = /(*)> 

y = ” 

Whenever we do this, we must remember that the solution thus referred to is really the 
ordered pair of functions (f g) such that (7.65) simultaneously satisfy both equations of 
the system (7.61) identically on a < t < b. 


► Example 7.9 

The ordered pair of functions defined for all t by (e 5 \ — 3e 5t ), which we denote by 


x = e 5 \ 
y = —3e 5t , 

is a solution of the system (7.62). That is, 

x = e 5 ', 
y = -3e 5t , 


(7.66) 


(7.66) 


simultaneously satisfy both equations of the system (7.62). Let us verify this by directly 
substituting (7.66) into (7.62). We have 


dt 


(e 5 ‘) = 2(e 5t ) — ( — 3e 5 '), 


or 


-(-3e 5t ) = 3(e 5 ') + 6(-3e 5 ‘), 


5e St = 2e 5 ’ + 3c 5 ', 
-15c 5 ' = 3c 5 '- 18c 5 '. 
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Hence (7.66) is indeed a solution of the system (7.62). The reader should verify that the 
ordered pair of functions defined for all t by (e 3 \ - e 3t ), which we denote by 



is also a solution of the system (7.62). 


We shall now proceed to survey the basic theory of linear systems. We shall observe a 
close analogy between this theory and that introduced in Section 4.1 for the single 
linear equation of higher order. Theorem 7. 1 is the basic existence theorem dealing with 
the system (7.61). 


THEOREM 7.1 

Hypothesis. Let the functions a n , a 12 , F t , a 2U a 22 ,andF 2 in the sy stem ( 7 . 61) all be 
continuous on the interval a < t < b. Let t 0 be any point of the interval a < t < b; and let 
c t and c 2 be two arbitrary constants. 

Conclusion. There exists a unique solution 

X = fit), 

y = 9{t), 

of the system (7.61) such that 

f(t 0 ) = c l and g(t 0 ) = c 2 , 
and this solution is defined on the entire interval a < t < b. 


► Example 7.10 

Let us consider the system (7.63). The continuity requirements of the hypothesis of 
Theorem 7.1 are satisfied on every closed interval a < t <b. Hence, given any point t 0 
and any two constants c t and c 2 , there exists a unique solution x — f (f), y = g(t) of the 
system (7.63) that satisfies the conditions f(t 0 ) = c u g(t 0 ) = c 2 . For example, there 
exists one and only one solution x = /(f), y = g(t) such that /( 2) = 5, g(2) = — 7. 


B. Homogeneous Linear Systems 

We shall now assume that F l (t) and F 2 (t) in the system (7.61) are both zero for all t and 
consider the basic theory of the resulting homogeneous linear system 

~- = a ll (t)x + a i2 (t)y. 


— = a 2i (t)x + a 22 (t)y. 


(7.67) 
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We shall see that this theory is analogous to that of the single nth-order homogeneous 
linear differential equation presented in Section 4. IB. Our first result concerning the 
system (7.67) is the following. 


THEOREM 7.2 
Hypothesis. Let 


* = fM 


y = 0i ( 0 , 


and 


x = f 2 (t ), 


y = 02 W, 


(7.68) 


be Wo solutions of the homogeneous linear system (7.67). Let c l and c 2 be two arbitrary 
constants. 


Conclusion. Then 

X = Cifi(t) + c 2 f 2 (t\ 
y = Ci0iW + c 2 g 2 (t\ 
is also a solution of the system (7.67). 


(7.69) 


DEFINITION 

The solution (7.69) is called a linear combination of the solutions (7.68). This definition 
enables us to express Theorem 7.2 in the following alternative form. 


THEOREM 7.2 RESTATED 

Any linear combination of two solutions of the homogeneous linear system (7.67) is itself a 
solution of the system (7.67). 


► Example 7.1 1 


We have already observed that 


x = e 5 \ 


x = e 3 \ 


and 


y = —7>e 5 \ 


<r i 

QU 

1 

II 


are solutions of the homogeneous linear system (7.62). Theorem 7.2 tells us that 

x = c t e 5t -f c 2 e 3 \ 
y = —3 Cj e 5t — c 2 e 3 \ 

where c x and c 2 are arbitrary constants, is also a solution of the system (7.62). For 
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example, if c l = 4 and c 2 — -2, we have the solution 

x = 4e 5t — 2e 3 \ 
y= -12e 5t + 2e 3t . 


DEFINITION 

Let 


X = fl(t), X = f 2 (t), 

and 

y = gM y = g 2 (t)> 

be two solutions of the homogeneous linear system (7.67). These two solutions are linearly 
dependent on the interval a < t <b if there exist constants c t and c 2 , not both zero, such 
that 


+ c 2 f 2 (t) — 0, 
c i0i(O + c 2 g 2 (t) = 0, 

for all t such that a < t < b. 


(7.70) 


DEFINITION 


Let 


x = Mt), 


y = 9iit\ 


and 


x = f 2 (t\ 


y = Giit\ 


be two solutions of the homogeneous linear system (7.67). These two solutions are linearly 
independent on a < t < b if they are not linearly dependent on a < t < b. That is, 
the solutions x = f t (t\ y = g t (t) and x = f 2 (t ), y = g 2 (t) are linearly independent on 
a < t <b if 


c ifi(t) + c 2fi(t) — 
CiQiit) + c 2 g 2 (t) = 0, 
for all t such that a < t < b implies that 

Ci = c 2 = 0. 


(7.71) 


► Example 7.1 2 

The solutions 


x = e 5 \ 

x = 2e 5 \ 

and 


•n 

m 

1 

II 

VO 

1 

II 


of the system (7.62) are linearly dependent on every interval a<t <b. For in this case 
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the conditions (7.70) become 

c t e 5t + 2 c 2 e 5t = 0, 

-3c t e 5, -6c 2 e s, = 0, ^' 72) 

and clearly there exist constants c t and c 2 , not both zero, such that the conditions (7.72) 
hold on a < t < b. For example, let c i = 2 and c 2 = — 1. 

On the other hand, the solutions 

x = e 5 ', x = e 3 ', 

and 

y = — 3e 5f , y = — e 3 \ 

of system (7.62) are linearly independent on a < t < b. For in this case the conditions 
(7.71) are 

c t e 5t + c 2 e 3t = 0, 

— 3 c t e 5t — c 2 e 3t = 0. 

If these conditions hold for all t such that a < t < b, then we must have c Y — c 2 — 0. 

We now state the following basic theorem concerning sets of linearly independent 
solutions of the homogeneous linear system (7.67). 


THEOREM 7.3 

There exist sets of two linearly independent solutions of the homogeneous linear system 
(7.67). Every solution of the system (7.67) can be written as a linear combination of any 
two linearly independent solutions of (7.67). 

► Example 7.1 3 
We have seen that 


x = e 3 \ 


x = e 3 ', 


and 


y=-3e 5t . 


y — — e 3t . 


constitute a pair of linearly independent solutions of the system (7.62). This illustrates 
the first part of Theorem 7.3. The second part of the theorem tells us that every solution 
of the system (7.62) can be written in the form 

x = c t e 5t + c 2 e 3 \ 

y = — 3c^ 5f — c 2 e 3 \ 

where c x and c 2 are suitably chosen constants. 

Recall that in Section 4.1 in connection with the single nth-order homogeneous 
linear differential equation, we defined the general solution of such an equation to be a 
linear combination of n linearly independent solutions. As a result of Theorems 7.2 and 
7.3 we now give an analogous definition of general solution for the homogeneous linear 
system (7.67). 
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DEFINITION 

Let 

x = x = f 2 (t\ 

and 

y = 0i (t), y = 0 2 (O, 

be two linearly independent solutions of the homogeneous linear system (7.67). Let c t and 
c 2 be two arbitrary constants. Then the solution 

x = cj t (t) + c 2 f 2 (t\ 

y — c i0i(O + c 2 g 2 (t), 

is called a general solution of the system (7.67). 


► Example 7.14 

Since 


x = e 5 ', 


x = e 3 ‘. 


and 


y = —3e 5 ', 


y = — e 3 ', 


are linearly independent solutions of the system (7.62), we may write the general 
solution of (7.62) in the form 

x = c t e 5t + c 2 e 3 \ 
y = — 3 c t e 5t — c 2 e 3 \ 
where c l and c 2 are arbitrary constants. 

In order to state a useful criterion for the linear independence of two solutions of 
system (7.67), we introduce the Wronskian of two solutions. Note that this is similar to, 
but different from, the now familiar Wronskian of two solutions of a single second- 
order linear equation, introduced in Section 4.1. 


DEFINITION 

Let 


X = flit), 

and 

y = 9i(t), 


X = f 2 (t). 


y = 9i it). 


be two solutions of the homogeneous linear system (7.67). The determinant 

hit) hit) 

9i(t) 9 lit) 


(7.73) 


is called the Wronskian of these two solutions. We denote it by W{ 1 ). 

We may now state the following useful criterion for the linear independence of two 
solutions of system (7.67). 
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THEOREM 7.4 


Two solutions 


x = 

and 

y = gM 


x = f 2 (t). 


y = Qiit), 


of the homogeneous linear system (7.67) are linearly independent on an interval a < t <b 
if and only if their Wronskian determinant 


W(t) = 


fi(t) f 2 (t) 

0 i (t) g 2 (t) 


is different from zero for all t such that a < t < b. 

Concerning the values of W(t ), we also state the following result. 


(7.73) 


THEOREM 7.5 

Let W(t) be the Wronskian of two solutions of homogeneous linear system (7.67) on an 
interval a < t < b. Then either W(t) = 0 for all t e [a, b] or W(t) = 0 for no t e [ a , b~\. 


► Example 7.15 


Let us employ Theorem 7.4 to verify the linear independence of the solutions 

x = e 5 \ x = e 3 \ 

and 

y = —3e 5 \ y = —e 3 \ 


of the system (7.62). We have 

W(t) = 


, 5 1 


,3r 


e e 
— 3e 5t —e 3t 


= 2e St # 0 


on every closed interval a < t < b. Thus by Theorem 7.4 the two solutions are indeed 
linearly independent on a < t < b. 


C. Nonhomogeneous Linear Systems 

Let us now return briefly to the nonhomogeneous system (7.61). A theorem and a 
definition, illustrated by a simple example, will suffice for our purposes here. 

THEOREM 7.6 
Hypothesis. Let 

X = fo(t), 

y = go(t). 
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be any solution of the nonhomogeneous system (7.61), and let 

x = f(t) 9 
y = g(t) 9 

be any solution of the corresponding homogeneous system (7.67). 

Conclusion. Then 

x = f(t) + fo(t\ 
y = g(t) + g 0 (t\ 

is also a solution of the nonhomogeneous system (7.61). 


DEFINITION 

Let 

x = fo(t\ 

y = 0o(O, 

be any solution of the nonhomogeneous system (7.61), and let 

x = /i(0> x = f 2 (t), 

and 

y = ffi(0 y = g 2 (t\ 

be two linearly independent solutions of the corresponding homogeneous system (7.67). 
Let Cj and c 2 be two arbitrary constants. Then the solution 

x = c i/i(0 + ^ 2 / 2(0 + /o(0» 
y = Cigi(t) + c 2 g 2 (t) + g 0 (t), 

will be called a general solution of the nonhomogeneous system (7.61). 

► Example 7.16 

The student may verify that 

x — 2 1 + 1, 

y= -u 

is a solution of the nonhomogeneous system (7.63). The corresponding homogeneous 
system is the system (7.62), and we have already seen that 

x = e 5t , x = e 3t , 

and 

y = —3e 5t , y = —e 3t , 

are linearly independent solutions of this homogeneous system. Theorem 7.6 tells us, 
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for example, that 

x = e 5t + 2t + 1, 
y = —3e 5t — t, 

is a solution of the nonhomogeneous system (7.63). From the preceding definition we 
see that the general solution of (7.63) may be written in the form 

x = c x e 5t + c 2 e 3t + 2t + 1, 

y = —3 c { e 5t — c 2 e 3t — t, 

where c x and c 2 are arbitrary constants. 


Exercises 

1. Consider the linear system 


dx 

— = 3x + 4 y, 
dt 


dy 

dt 


= 2 x + y. 


x = 2e 5t , 


X = 


and 


y = e 5t , 


y = 


(a) Show that 


are solutions of this system. 

(b) Show that the two solutions of part (a) are linearly independent on every 
interval a < t <b, and write the general solution of the system. 

(c) Find the solution 

X = f(t), 

y = g(t), 

of the system which is such that /( 0) = 1 and g(0) = 2. Why is this solution 
unique? Over what interval is it defined? 


2. Consider the linear system 


dx 

— = 5x + 3 y, 
dt 

dy 


(a) Show that 


x = 3e 7 


and 


y = 2e 1 \ 

are solutions of this system. 


y= -2e~\ 
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(b) Show that the two solutions of part (a) are linearly independent on every 
interval a < t < b, and write the general solution of the system. 

(c) Find the solution 

X = /(f), 

y = g(t\ 

of the system which is such that /( 0) = 0 and g(0) = 8. 

3. (a) Show that 

x = 2e 2 \ x = e nt , 

and 

y = —3e 2 \ y = e 7 \ 

are solutions of the homogeneous linear system 

dx 

* =5x+2 * 


3x + 4,. 
dt 

(b) Show that the two solutions defined in part (a) are linearly independent on 
every interval a < t < b, and write the general solution of the homogeneous, 
system of part (a). 

(c) Show that 

x = t + 1, 
y = — 5r — 2, 

is a particular solution of the nonhomogeneous linear system 

dx 

— — 5x + 2 y 5 

at 

dv 

— = 3x + 4 y + lit, 
at 

and write the general solution of this system. 

4. Let 

X = /l(f)> X = f 2 (t), 

and 

y = g i(f), y = g lit), 

be two linearly independent solutions of the homogeneous linear system (7.67), and 
let W(t) be their Wronskian determinant (defined by (7.73)). Show that W satisfies 
the first-order differential equation 


dW 

IT 


= Kl(f) + 022(f)] W 


5. Prove Theorem 7.2. 

6. Prove Theorem 7.6. 
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7.4 HOMOGENEOUS LINEAR SYSTEMS WITH CONSTANT COEFFICIENTS: 
TWO EQUATIONS IN TWO UNKNOWN FUNCTIONS 


A. Introduction 


In this section we shall be concerned with the homogeneous linear system 


dx 

dt 


= a { x 


+ b x y. 


dy 

dt 


= a 2 x + b 2 y, 


(7.74) 


where the coefficients a u b ly a 2 , and b 2 are real constants. We seek solutions of this 
system; but how shall we proceed? Recall that in Section 4.2 we sought and found 
exponential solutions of the single nth-order linear equation with constant coefficients. 
Remembering the analogy that exists between linear systems and single higher-order 
linear equations, we might now attempt to find exponential solutions of the system 
(7.74). Let us therefore attempt to determine a solution of the form 


x = Ae x \ 
y = Be x \ 


(7.75) 


where A, B , and X are constants. If we substitute (7.75) into (7.74), we obtain 

AXe Xt = a t Ae Xt + b { Be x \ 

BXe Xt = a 2 Ae Xt + b 2 Be Xt . 

These equations lead at once to the system 


(a { — X)A + b { B = 0, 

a 2 A + (b 2 - X)B = 0, (7 76) 

in the unknowns A and B. This system obviously has the trivial solution A = B = 0. 
But this would only lead to the trivial solution x = 0, y = 0 of the system (7.74). Thus 
we seek nontrivial solutions of the system (7.76). A necessary and sufficient condition 
(see Section 7.5c, Theorem A) that this system have a nontrivial solution is that the 
determinant 


a x — X 
a 2 



= 0 . 


Expanding this determinant we are led at once to the quadratic equation 


(7.77) 


X 2 - (a x + b 2 )X -f (a 1 b 2 - a 2 b { ) = 0 (7.78) 

in the unknown X. This equation is called the characteristic equation associated with the 
system (7.74). Its roots X { and X 2 are called the characteristic roots. If the pair (7.75) is 
to be a solution of the system (7.74), then / in (7.75) must be one of these roots. Suppose 
X = X l . Then substituting / = X 1 into the algebraic system (7.76), we may obtain a 
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nontrivial solution A u B u of this algebraic system. With these values A l9 B x we obtain 
the nontrivial solution 


x = A { e Xlt , 
y = J 3 { e Xl \ 

of the given system (7.74). 

Three cases must now be considered: 

1. The roots and k 2 are real and distinct. 

2. The roots X x and X 2 are conjugate complex. 

3. The roots and X 2 are real and equal. 


B. Case 1. The Roots of the Characteristic Equations (7.78) 
are Real and Distinct 

If the roots X r and X 2 of the characteristic equation (7.78) are real and distinct, it 
appears that we should expect two distinct solutions of the form (7.75), one 
corresponding to each of the two distinct roots. This is indeed the case. Furthermore, 
these two distinct solutions are linearly independent. We summarize this case in the 
following theorem. 


THEOREM 7.7 


Hypothesis. The roots X x and X 2 , of the characteristic equation (7.78) associated with 
the system (7.74) are real and distinct. 


Conclusion. 

the form 


The system (7.74) has two nontrivial linearly independent solutions of 


x = A x e Xl \ 

and 

y = B t e Xl \ 


x = A 2 e X2 \ 

y = B 2 e^‘, 


where A U B U A 2 ,and B 2 are definite constants. The general solution of the system (7.74) 
may thus be written 

x = c 1 A l e Xlt + c 2 A 2 e Xl \ 
y = c l B l e Xlt + c 2 B 2 e Xl \ 


where c x and c 2 are arbitrary constants. 


► Example 7.17 


^ = 6x - 3 y, 
dt 


dy 

dt 


= 2x + y. 


(7.79) 
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We assume a solution of the form (7.75): 

x = Ae x \ 
y = Be Xt . 

Substituting (7.80) into (7.79) we obtain 

AXe Xt = 6 Ae Xt — 3Be x \ 

BXe Xt = 2 Ae Xt + Be x \ 

and this leads at once to the algebraic system 

(6 - X)A - 3B = 0, 

2A + (1 - X)B = 0, 

in the unknown A. For nontrivial solutions of this system we must have 


(7.80) 


(7.81) 


6-A -3 

2 1 - X 


= 0 . 


Expanding this we obtain the characteristic equation 

X 2 -7/1 + 12 = 0. 

Solving this, we find the roots X t = 3, A 2 = 4. 

Setting 1 = Xi = 3 in (7.81), we obtain 

3A - 3B = 0, 

2A-2B = 0. 


A simple nontrivial solution of this system is obviously A — B — 1. With these values 
of A , B , and A we find the nontrivial solution 


x — e 


y = e 2t . 


(7.82) 


We note that a different solution for X and B here (for instance, A = B = 2) would only 
lead to a solution which is linearly dependent of (and generally less simple than) 
solution (7.82). 

Now setting X = X 2 = 4 in (7.81), we find 


2A — 3B = 0, 

2A-3B = 0. 

A simple nontrivial solution of this system is X = 3, B = 2. Using these values of A , £, 
and A we find the nontrivial solution 


x = 3e 4t , 
y = 2e 4f . 


(7.83) 


By Theorem 7.7 the solutions (7.82) and (7.83) are linearly independent (one may check 
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this using Theorem 7.4) and the general solution of the system (7.79) may be written 

x = c t e 3t + 3c 2 e 4t , 
y = c t e 3t + 2c 2 e 4t , 

where c 1 and c 2 are arbitrary constants. 


C. Case 2. The Roots of the Characteristic Equation (7.78) 
are Conjugate Complex 


If the roots and X 2 of the characteristic equation (7.78) are the conjugate complex 
numbers a + bi and a — hi , then we still obtain two distinct solutions 

x = A*e ia + bi)t , x = A*e {a ~ bi)t , 

and (7.84) 

y = BXe {a + bi)t , y = B$e ia ~ bi)t , 

of the form (7.75), one corresponding to each of the complex roots. However, the 
solutions (7.84) are complex solutions. In order to obtain real solutions in this case we 
consider the first of the two solutions (7.84) and proceed as follows: We first express the 
complex constants A\ and in this solution in the forms A^ = A x + \A 2 and 
J5* = B x + iB 2 , where A l9 A 2 , B l9 and B 2 are real. We then apply Euler’s formula 
e ie = cos 6 + i sin 6 and express the first solution (7.84) in the form 

x = (A t + iA 2 )e at (cos bt + i sin bt), 

y = (B { + iB 2 )e at (cos bt + i sin bt). 

Rewriting this, we have 

x = e at [{A i cos bt - A 2 sin bt) + i(A 2 cos bt + A { sin bt)'], 

y = e at [{B l cos bt - B 2 sin bt) + i(B 2 cos bt + B { sin bt)]. (7.85) 

It can be shown that a pair lf { (t) + if 2 (t ), g x (t) + ig 2 (t)] of complex functions is a 
solution of the system (7.74) if and only if both the pair [_fi(t) 9 g x (t)] consisting of their 
real parts and the pair [f 2 (t), g 2 (t)] consisting of their imaginary parts are solutions of 
(7.74). Thus both the real part 


x = e at (A l cos bt — A 2 sin bt\ 
y = e at (B l cos bt — B 2 sin bt), 

and the imaginary part 

x = e at (A 2 cos bt + Ai sin bt), 
y = e at (B 2 cos bt + B x sin bt), 


(7.86) 


(7.87) 


of the solution (7.85) of the system (7.74) are also solutions of (7.74). Furthermore, the 
solutions (7.86) and (7.87) are linearly independent. We verify this by evaluating the 
Wronskian determinant (7.73) for these solutions. We find 

e at (A 1 cos bt — A 2 sin bt) e at (A 2 cos bt + A x sin bt) 
e at (B x cos bt — B 2 sin bt) e at (B 2 cos bt + B { sin bt) 

= e 2 a \A,B 2 - A.B,). 


(7.88) 
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Now, the constant Bf is a nonreal multiple of the constant Af. If we assume that 
A i B 1 — A 1 B l = 0, then it follows that Bf is a real multiple of A which contradicts 
the result stated in the previous sentence. Thus A l B 2 — A 2 B l ^0 and so the 
Wronskian determinant W(t) in (7.88) is unequal to zero. Thus by Theorem 7.4 the 
solutions (7.86) and (7.87) are indeed linearly independent. Hence a linear combination 
of these two real solutions provides the general solution of the system (7.74) in this case. 
There is no need to consider the second of the two solutions (7.84). We summarize the 
above results in the following theorem: 


THEOREM 7.8 

Hypothesis. The roots X Y and X 2 of the characteristic equation (7.78) associated with 
the system (7.74) are the conjugate complex numbers a + bi. 

Conclusion. The system (7.74) has two real linearly independent solutions of the 
form 

x = e at (A x cos bt — A 2 sin bt ), x = e at (A 2 cos bt + A { sin bt), 

and 

y = e at (B { cos bt - B 2 sin bt), y = e at (B 2 cos bt + B t sin bt), 

where A u A 2 ,B i , and B 2 are definite real constants. The general solution of the system 

(7.74) may thus be written 

x = e at [c l (A l cos bt - A 2 sin bt) + c 2 (A 2 cos bt + A l sin bt)], 

y = e at [c l (B 1 cos bt — B 2 sin bt) + c 2 (B 2 cos bt + B { sin bt)], 

where c { and c 2 are arbitrary constants. 


► Example 7.18 


dx 

— = 3x + 2 y, 
dt 

d y , , 

-=-5x + ,. 

We assume a solution of the form (7.75): 

x = Ae Xl , 

>> = Be Xt . 

Substituting (7.90) into (7.89) we obtain 

AXe x ' = 3Ae Xl + 2 Be x ‘, 
BXe x, = -5Ae Xl + Be x \ 
and this leads at once to the algebraic system 

(3 - X)A + IB = 0, 
-5A + (1 -X)B = 0, 


(7.89) 


(7.90) 


(7.91) 
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in the unknown X. For nontrivial solutions of this system we must have 

3 -" 2 = 0 . 

-5 1 - X 

Expanding this, we obtain the characteristic equation 

X 2 -4X+ 13 = 0. 

The roots of this equation are the conjugate complex numbers 2 ± 3 i. 

Setting X = 2 + 3i in (7.91), we obtain 

(1 -3 i)A + 2J3 = 0, 

— 5/4 + ( — 1 — 3i)B = 0. 

A simple nontrivial solution of this system is A = 2, B = — 1 + 3i. Using these values 
we obtain the complex solution 

x = 2e (2 + 3i)t , 
y = (- 1 + 3i)e (2 + 3t)t , 

of the given system (7.89). Using Euler’s formula this takes the form 

x = e 2t [(2 cos 3 1) + i(2 sin 3 1)], 

y = e 2t [( — cos 3r — 3 sin 3 1) + i( 3 cos 3 1 — sin 3 1)~\. 

Since both the real and imaginary parts of this solution of system (7.89) are themselves 
solutions of (7.89), we thus obtain the two real solutions 

x = 2e 2t cos 3 1, 

(7.92) 


and 


y = —e 2t (cos 3t + 3 sin 3 1), 

x = 2e 2t sin 3 1, 
y = e 2t (3 cos 3 1 — sin 3 1). 


(7.93) 


Finally, since the two solutions (7.92) and (7.93) are linearly independent we may write 
the general solution of the system (7.89) in the form 

x = 2e 2t (c i cos 3 1 + c 2 sin 3t), 

y = c 2f [ci( — cos 3t — 3 sin 3 1) + c 2 (3 cos 3 1 — sin 3 £)], 

where c { and c 2 are arbitrary constants. 


D. Case 3. The Roots of the Characteristic Equation (7.78) 
are Real and Equal 

If the two roots of the characteristic equation (7.78) are real and equal, it would appear 
that we could find only one solution of the form (7.75). Except in the special subcase in 
which aj = b 2 # 0, a 2 = b x = 0 (see Exercise 33 at the end of this section) this is indeed 
true. In general, how shall we then proceed to find a second, linearly independent 
solution? Recall the analogous situation in which the auxiliary equation corresponding 
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to a single nth-order linear equation has a double root. This would lead us to expect a 
second solution of the form 


x = Ate Xx , 
y = Bte Xt . 


However, the situation here is not quite so simple (see Exercise 34 at the end of this 
section). We must actually seek a second solution of the form 


x = (A x t + A 2 )e Xt , 
V = (B x t + B 2 )e» 


(7.94) 


We shall illustrate this in Example 7.19. We first summarize Case 3 in the following 
theorem. 


THEOREM 7.9 

Hypothesis. The roots and X 2 of the characteristic equation (7.78) associated with 
the system (7.74) are real and equal. Let X denote their common value. Further assume that 
system (7.74) is not such that a { = b 2 / 0, a 2 = b x = 0. 

Conclusion. The system (7.74) has two linearly independent solutions of the form 

x = Ae Xt , x = (A x t + A 2 )e Xt , 

and 

y = Be kt , y = (B, t + B 2 )e Xt , 

where A, B, A l9 A 2 , B t , and B 2 are definite constants , and are not both zero , and 

Bi/Ai = B/A. The general solution of the system (7.74) may thus be written 

x = c t Ae Xt -f c 2 (A t t + A 2 )e x \ 

y = c x Be Xt + c 2 (Bit + B 2 )e Xt , 

where c { and c 2 are arbitrary constants. 

► Example 7.19 


dx 

dy -> 

J,~ x + 2y 

We assume a solution of the form (7.75): 

x — Ae M , 
y = Be*'. 

Substituting (7.96) into (7.95) we obtain 

AXe M = 4Ae*‘ - Be*', 
BXe *' = Ae M + 2 Be*', 


(7.95) 


(7.96) 



308 SYSTEMS OF LINEAR DIFFERENTIAL EQUATIONS 


and this leads at once to the algebraic system 

(4 - X)A - B = 0, 

A + (2 - X)B = 0, 

in the unknown X. For nontrivial solutions of this system we must have 


Expanding this we obtain the characteristic equation 

X 2 - 6X + 9 = 0 


(7.97) 


(7.98) 


or 


(X - 3) 2 = 0. 

Thus the characteristic equation (7.98) has the real and equal roots 3, 3. 
Setting X = 3 in (7.97), we obtain 

A - B = 0, 

A - B = 0. 


A simple nontrivial solution of this system being A = B = 1, we obtain the nontrivial 
solution 


x = e 


3 1 


y = e 3t , 


(7.99) 


of the given system (7.95). 

Since the roots of the characteristic equation are both equal to 3, we must seek a 
second solution of the form (7.94), with X = 3. That is, we must determine A lf A 2 , B u 
and B 2 (with A x and B l not both zero) such that 


x = (A t t + A 2 )e 3t , 
y = (B lt + B 2 )e 3 ‘, 

is a solution of the system (7.95). Substituting (7.100) into (7.95), we obtain 
(3/1 jt + 3A 2 + = 4(/4jt + A 2 )e 31 — (B±t + B 2 )e 3 ', 

(3 B^ + 3 B 2 + B t )e 31 = (A { t + A 2 )e 3 ' + 2(B,r + B 2 )e 3t . 

These equations reduce at once to 

(A i -B 1 )t + (A 2 -A 1 1 -B 2 ) = 0, 

(A 1 -B 1 )t + (A 2 -B l -B 2 ) = 0. 

In order for these equations to be identities, we must have 

A\ — Bj = 0, A 2 A i B 2 = 0, 

A 1 -B 1 = 0, A 2 -B l -B 2 = 0. 


(7.100) 


(7.101) 


Thus in order for (7.100) to be a solution of the system (7.95), the constants A t , A 2 , 
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B u and B 2 must be chosen to satisfy the equations (7.101). From the equations 
A\ — B i = 0, we see that A 1 = B x . The other two equations of (7.101) show that A 2 
and B 2 must satisfy 

A 2 -B 2 = A l = B l . (7.102) 

We may choose any convenient nonzero values for A t and B t . We choose A = B x = 1. 
Then (7.102) reduces to A 2 — B 2 = 1, and we can choose any convenient values for A 2 
and B 2 that will satisfy this equation. We choose A 2 = 1, B 2 = 0. We are thus led to the 
solution 


x = (t + l)e 3t , 


By Theorem 7.9 the solutions (7.99) and (7.103) are linearly independent. We may thus 
write the general solution of the system (7.95) in the form 

x = c { e 3t + c 2 (t + l)e 3t , 
y = c t e 3t + c 2 te 3 \ 

where c { and c 2 are arbitrary constants. 

We note that a different choice of nonzero values for A { and B { in (7.102) and/or a 
different choice for A 2 and B 2 in the resulting equation for A 2 and B 2 will lead to a 
second solution which is different from solution (7.103). But this different second 
solution will also be linearly independent of the basic solution (7.99), and hence could 
serve along with (7.99) as one of the two constituent parts of a general solution. For 
example, if we choose A t = B { = 2 in (7. 102), then (7. 102) reduces to A 2 — B 2 = 2; and 
if we then choose A 2 = 3, B 2 = 1, we are led to the different second solution 

x = (2 1 + 3)e 3t , 
y = (2t + l)e 3t . 

This is linearly independent of solution (7.99) and could serve along with (7.99) as one 
of the two constituent parts of a general solution. 


Exercises 

Find the general solution of each of the linear systems in Exercises 1-22. 


dx 

~ dx 

-di =5x ~ iy - 

1 Tt = 5x ~ y ’ 

1 

X 

II 

5 - 1 ^- 

II 

X 

+ 

dx 

dx 

*- x+2> ' 

4. — = 2x + 3y, 

^-3* + 2 
dt 

dy . 

j, = - x ~ 2> - 
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5. 

dx 

6. 

dx 

dt 


d -f = * X + 3y. 
dt 


dy 
dt ‘ 

7. 


8. 

dx 

It 




dy 
dt ' 

9. 

dx 

^-* + 3,, 

10. 

dx 

It 


dy o 


dy 

dt 

11. 

dx 

A**- 4 * 

12. 

dx 

It 


dy 

i;= x + y - 


dy 

dt 

13. 

dx 

ir x - ,y - 

14. 

dx 

It 


jr ix + y - 


dy 

dt 

15. 

!-*-* 

16. 

dx 

dt 


^ = 5x + 2y. 
rft 


dy 

dt 

17. 

dx * * 

— = 3x - 2y, 

18. 

dx 

~dt 


^j- = 2x + 3y. 
dt 


dy 

dt 

19. 

dx 

it =lx - r ' 

20. 

dx 

It 


dy 

T,= ix ~ y 


dy 

dt 

21. 

dx 

T,- ix + iy 

22. 

dx 

It 


dy 

— = -x + y. 


dy 

dt 


6x — y, 

3x + 2 y. 
2x — y, 
9x + 2 y. 
3x + 2y, 
6x — y. 
lx — 3y, 
3x + 2y. 
5x — 4j;, 
2x + y. 
x - 5 y, 
lx — y. 
6x — 5 y, 
x + 2 y. 

■■ lx + Ay, 
-x + 3 y. 
■ x — 2y, 
2x — 3 y. 
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In each of Exercises 23-28, find the particular solution of the linear system that satisfies 
the stated initial conditions. 


dx 

¥=- 2 * +7 * 

24. 

dx 

— = — 2x + y, 
dt 

d i-3x + 2y, 
dt 


dy 

i- n + iy 

x(0) = 9, y(0) = — 1. 


x(0) = 6, y(0) = 2. 

£ = 2 x - 8 ,. 

26. 

dx 

— = 3x + 5y, 
dt 

g-* + 6y. 


dy 

j,=- 2x+5y 

x(0) = 4, y(0) = 1. 


x(0| = 5. WO) - -1 

dx 

_ = 6x- 4 y 

28. 

dx 

Tt= lx ~ y 

dy 

i~ x+ly ' 



xfO) - 2, vlOl - 3. 


x(0) = 1, y(0) = 3. 


29. Consider the linear system 

dx , 

t— - a t x + b t y, 
dt 

dy 

t- = a 2 x + b 2 y, 

where a u b u a 2 , and b 2 are real constants. Show that the transformation t = e w 
transforms this system into a linear system with constant coefficients. 

30. Use the result of Exercise 29 to solve the system 

dx 

t- = x + y, 

t = — 3x + 5y. 
dt 

31. Use the result of Exercise 29 to solve the system 

dx 

( - = 2x + 3y, 

t^f = 2x + y. 
dt 
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32. Consider the linear system 

dx 

- = ai x + b iy , 

dy 

— = a 2 x + b 2 y, 

where a l ,b l , a 2 , and b 2 are real constants. Show that the condition a 2 b { > 0 is 
sufficient, but not necessary, for the system to have two real linearly independent 
solutions of the form 

x = Ae Xt , y = Be Xt . 

33. Suppose that the roots of the characteristic equation (7.78) of the system (7.74) are 
real and equal; and let X denote their common value. Also assume that the system 
(7.74) is such that a Y = b 2 # 0 and a 2 = b { = 0. Show that in this special sub- 
case there exist two linearly independent solutions of the form (7.75). 

34. Suppose that the roots of the characteristic equation (7.78) of the system (7.74) are 
real and equal; and let X denote their common value. Also assume that the system 
(7.74) is not such that = b 2 / 0 and a 2 = b { = 0. Then show that there exists no 
nontrivial solution of the form 

x = Ate Xt , y = Bte Xt , 

which is linearly independent of the “basic” solution of the form (7.75). 

35. Referring to the conclusion of Theorem 7.9, show that BJA x = B/A in the case 
under consideration. 


7.5 MATRICES AND VECTORS 

A. The Most Basic Concepts 

The study of matrices and vectors is a large and important subject, and an entire 
chapter the size of this one would be needed to present all of the most fundamental 
concepts and results. Therefore, after defining a matrix, we shall introduce only those 
very special concepts and results that we shall need and use in this book. For the reader 
familiar with matrices and vectors, this section will be a very simple review of a few 
select topics. On the other hand, for the reader unfamiliar with matrices and vectors, the 
detailed treatment here will provide just what is needed for an understanding of the 
material presented in this book. 

DEFINITIONS 

A matrix is defined to be a rectangular array 

( f a 11 a \2 

a 2l a 22 

a ml a m2 
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of elements a {j (i = 1,2,..., m; j = 1,2,..., n), arranged in m ( horizontal ) rows and m 
(vertical) columns. Tfee matrix is denoted by the boldface letter A, as indicated ; and tfee 
element in its ith row and jth column , fey a Vp as suggested. We write A = (a^), and ca// A 
an m x n matrix. 

We shall be concerned with the two following special sizes of matrices. 

1. A square matrix is a matrix for which the number of rows is the same as the 
number of columns. If the common number of rows and columns is n, we call the 
matrix an n x n square matrix. We write 

a n a l2 
a 2i a 22 

a „i a n2 

2. A vector (or column vector) is a matrix having just one column. If the vector has n 
rows (and, of course, one column), we call it an n x 1 vector (or n x 1 column vector). 
We write 

*i 
*2 

The elements of a matrix (and hence, in particular, of a vector) may be real numbers, 
real functions, real function values, or simply “variables.” We usually denote square 
matrices by boldface Roman or Greek capital letters and vectors by boldface Roman or 
Greek lowercase letters. 

Let us give a few specific examples. 

The matrix 

1 3 6 l\ 

. -2 0 4-5 

7 5-3 2 

4 -1 3 -6/ 

is a 4 x 4 square matrix of real numbers; whereas defined by 

1C t + 1 5 \ 

O(t) = t t 2 3f 

0 2t-lJ 

is a 3 x 3 square matrix of real functions defined for all real t. 

The vector 

3 
1 
2 
5 
2 





c = 
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is a 5 x 1 column vector of real numbers; the vector <J> defined by 


l 


4*0 = 

is a 3 x 1 column vector of real functions defined for all real t\ and the vector 


e 
te l 

2e f + 1 


f xA 

x 2 

X = 

*3 

\ x *l 

is a 4 x 1 column vector in the four variables x l9 x 2 , x 3 , x 4 . 

The elements of a vector are usually called its components. Given an n x 1 column 
vector, the element in the ith row, for each i = 1, 2 ,..., n , is then called its ith 
component. 

For example, the third component of the column vector c illustrated above is 2, and 
its fourth component is 5. 

For any given positive integer n , the n x 1 column vector with all components equal 
to zero is called the zero vector and is denoted by 0. Thus if n = 4, we have the 4 x 1 
zero vector 


/°\ 

0 

0 

w 


DEFINITION 

We say that two m x n matrices A = (a^) and B = (by) are equal if and only if each 
element of one is equal to the corresponding element of the other. That is, A and B are 
equal if and only if a u = b u fori = 1,2 ,...,m and j = 1, 2, . . . , n. If A and B are equal, we 
write A = B. 


Thus, for example, the two 3x3 square matrices 



f <*n 

<*12 

«13\ 



6 

5 


A = ( 

<*21 

<*22 

<*23 

| and 

B = 

-1 

0 

8 


^<*3 1 

<*32 

<*3 3y j 



l 0 

-2 

-v 

are equal if and only 

if 








<*n =6, a 12 = 5, 

a 

13 = 

7, 

<*21 = 

-1, 

a 22 

= 0, 



<*23 = 8 , a 31 = 0, a 32 = -2, and a 33 = -A. 


We then write A = B. 



7.5 MATRICES AND VECTORS 315 


Likewise, the vectors 



lx A 



X = 

x 2 

and c = 

7 


*3 


2 


N 


1-6/ 


are equal if and only if 

= — 3, x 2 = 7, x 3 = 2 , x 4 = 

We then write x = c. 


- 6 . 


DEFINITION Addition of Matrices 


The sum of two m x n matrices A = ( a y ) a/id B = is defined to be them x n matrix 
C = ( Cu ), where c i} — a ^ + bi^for i = 1, 2, . . . , m and j = 1,2,..., n. We write C = A + B. 

We may describe the addition of matrices by saying that the sum of two m x n 
matrices is the m x n matrix obtained by adding element-by-element. 

Thus, for example, the sum of the two 3x3 square matrices 


/l 

4 

" 5 1 

/-5 

4 

°\ 

6 

2 

0 

and B = 1 

-1 

3 


8 


\ 6 

2 

V 


h- 5 

4 + 4 

— 5 + 0^ 


—4 

8 

~ 5 \ 

6 + 1 

2- 1 

0 + 3 

= 

7 

1 

3 

\9 + 6 

8 + 2 

3 + 7) 



10 

10 


is the 3 x 3 square matrix 


C = 


We write C = A + B. 

Likewise, the sum of the vectors 


x = 


DEFINITION Multiplication by a Number 

The product of the m x n matrix A = (a y ) and the number c is defined to be them x n 
matrix B = where = ca y for i = 1,2 ,...,m and j = 1,2,..., n. We write B = cA. 

We may describe the multiplication of an m x n matrix A by a number c by saying 
that the product so formed is the m x n matrix that results from multiplying each 
individual element of A by the number c . 

Thus, for example, the product of the 3 x 3 square matrix 


l Xl ) 


t”‘\ 


fxi + yA 

*2 

and y = 


is x + y = 

' 

x 2 + y 2 

x 3 


y, 

x 3 + y 3 

V‘l 


V‘l 


^4 + y* j 
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by the number 3 is the 3x3 square matrix 



( 3-2 3 • (—3) 3-6 \ 


1 6 

-9 

18\ 

B = 

3 * ( — 7) 3-4 3 • ( — 1) 


-21 

12 

-3 


i 3-0 3-5 3-2 J 


l 0 

15 

6 


We write B = 3A. 

Likewise, the product of the vector 

l x i 

x= 

*3 

w 

by the number 5 is the vector 


/5x,\ 



\ 5x */ 


We write y = 5x. 

DEFINITION 

Letx l ,x 2 ,...,x m bernn x 1 vectors, and let c l9 c 2 , . . . , c m be m numbers. Then an element 
of the form 

CiXi + c 2 x 2 + — + c m x m 

is an n x 1 vector called a linear combination of the vectors x 1? x 2 , . . . , x m . 

For example, consider the four 3 x 1 vectors 



and the four real numbers 2, 4, 5, and — 3. Then 



/ 4+12 + 5-12 \ / 9\ 

= -2 + 8-15-15 = -24 
^6 + 4—10 + 0 J \ 0 j 

is a linear combination of x l9 x 2 , x 3 , and x 4 . 
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DEFINITION 

Let 


1 

/* U 

a l2 

'■ “ln\ 


l x A 

A = 

’ fl 21 

1 ! 

a 22 

a 2n 

and x = 

x 2 


\ a nl a n2 

" a nnj 


\ x -l 


be ann x n square matrix and ann x 1 vector , respectively. Then the product Ax of the 
n x n matrix A by the n x 1 vector x is defined to be the n x 1 vector 


Ax = 


/« 11*1 + fll 2*2 + ••• + 

a 2l x l + fl 22 x 2 + " ' + ®2n x n 


\a nl x i + a n2 x 2 + ••• + a nn x„J 
Note that Ax is a vector. If we denote it by y and write 

y = Ax, 


where 


y = 


/M 

y 2 

♦ 5 

w 


then we have 


Ti = + ^12*2 + ••• + 

y 2 = a 2 iX! + fl 22 x 2 + ••• + a 2 „x„, 


= a„ 1X1 + a„ 2 x 2 + ••• + a„„x„; 
that is, in general, for each i = 1,2 


For example, if 


Ti = a, 1*1 + ®i 2 x 2 + ••• + a in x„ = £ a y x ; . 

1 


A = 


2 

-4 

7\ 


( x i\ 

5 

3 

-8 

and x = 

X 2 

3 

6 



i X 3 


then 

( 2x ! — 4 x 2 + 7x 3 
5xj + 3x 2 — 8x 3 |. 

— 3x t + 6x 2 + x 3 j 

Before introducing the next concept, we state and illustrate two useful results, leaving 
their proofs to the reader (see Exercise 7 at the end of this section). 
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Result A. If A is ann x n square matrix and x and y are n x 1 column vectors , then 

A(x + y) = Ax + Ay. 

Result B. If A is an n x n square matrix , x is an n x 1 column vector , and c is a 
number , then 

A(cx) = c(Ax). 


► Example 7.20 


We illustrate Results A and B using the matrix 


A = -1 
3 


-2 5 


the vectors 


and y = y 2 , 


and the number c = 3. 

Illustrating Result A, we have 

/ 2 1 3 \lx l + yA 

A(x + y) = — 1 4 1 x 2 + y 2 

\ 3 -2 5)\x 3 + y 3 ] 

1 2(x, + yi) + (x 2 + y 2 ) + 3(x 3 + y 3 ) \ 
= -(xi + yi) + 4 (x 2 + y 2 ) + (x 3 + y 3 ) 
\3(X! + y t ) - 2(x 2 + y 2 ) + 5(x 3 + y 3 ) ) 

I 2x t + x 2 + 3x 3 \ / 2y l + y 2 + 3 y 3 

= — x t + 4x 2 + x 3 + —y t + 4 y 2 + y 3 
\3x l -2x 2 + 5x 3 \3>’ 1 — 2y 2 + 5y 3 


2 1 3\ 

-1 4 1 

3 -2 5, 


•(■! 


1 3 \lyC 
4 1 y 2 
-2 5 \y 3 


Illustrating Result B, we have 


= Ax + Ay. 


I 2x t + x 2 + 3x 3 
= 3 -X! + 4x 2 + x 3 
\3x i — 2x 2 + 5x 3 


I 6x t + 3 *x 2 + 9x 3 
| = -3x t + 12x 2 -f 3 x 3 
y 9x t — 6x 2 + 15x 3 

= c(Ax). 
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We have seen examples of vectors whose components are numbers and also of 
vectors whose components are real functions defined on an interval [a, ft]. We now 
distinguish between these two types by means of the following definitions: 


DEFINITIONS 

L A vector all of whose components are numbers is called a constant vector. 

2. A vector all of whose components are real functions defined on an interval [a, ft] is 
called a vector function. 


DEFINITIONS 


Let <|> be the n x 1 vector function defined by 

4i(0\ 


4>(t) = 




for all t on a real interval [a, ft]. 

(1) Suppose the components ^(f), </> 2 (t), . . . , are differentiable on [a, ft]. Then the 

derivative of f> is the vector function defined by 

/#iW\ 


dm 

dt 


dt 

d<p 2 (t) 


dt 


d<l>„{t) 

\~dTl 


for all t e [a, b~\. 

(2) Suppose the components (fr^t), (j> 2 (t ), . . . , (f n (t ) are integrable on [a, ft]; and let t 0 and 
t e [a, ft], where t 0 < t. Then the integral of </> from t 0 to t is the vector function defined by 


I' 


<|>(u) du = 


(fifu) du 


(j> 2 (u) du 


4>„(u) du 


Thus the derivative of a given vector function all of whose components are 
differentiable is the vector function obtained from the given vector function by 
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differentiating each component of the given vector function. Likewise, the integral from 
t 0 to t of a given vector function, all of whose components are integrable on the given 
interval, is the vector function obtained from the given vector function by integrating 
each component of the given vector function from t 0 to t. 


► Example 7.21 


The derivative of the vector function 4> defined for all t by 

4 1 3 \ 

(|)(f) = I 2t 2 T 3 1 


2e 3 


I 


is the vector function defined for all t by 

dm 


dt 


lit 2 \ 
4t + 3 

/ 


Also, the integral of <|> from 0 to t is the vector function defined for all t by 



[\uUu \ 


\ 


Jo 



<\>(u)du = 

0 

j* (2u 2 + 3 u) du 

= 

it 3 +it 2 


\ 1 


ii(e 3 ' - 1) 


Exercises 

1. In each case, find the sum A + B of the given matrices. 


« 3 - -U 1> 



( 2 

1 



I 7 

-1 

6 \ 

(b) A = 

-1 

0 

5 

and B = 


4 

-3 


I" 4 

3 

- 2 i 


1^5 

-5 

V 


h 5 

0 



1 7 

-2 

— 3\ 

(c) A= -2 

-1 

-3 ’ 

and B = 

! 6 

-3 

1 

l 6 

2 

5 i 


l- 2 

1 

-3J 


2. In each case, find the product cA of the given matrix A and the number c. 

« a -(, _j)- c=3 - 
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1 1 

-3 

5\ 

(b) 

A = 

1 6 

-2 

0 , c = -4. 



I" 3 

1 

’/ 



1 5 - 

1 

2 \ 

(c) 

A = 1 

4 - 

3 

-2, c=-3. 


\0 3-6 J 

3. In each case, find the indicated linear combination of the given vectors 







l 2 \ 


H\ 


2 


1 3 


'4' 

, and x 4 = 

I 2 

Xl = 

-1 

, x 2 = 

i 5 

, x 3 = 

0 

1“ 3 


4 i 


1 - 2 i 


l 6 i 

’ 

\ 5 I 


(a) 2xj + 3x 2 — x 3 . 

(b) 3x t — 2x 2 + 4x 4 . 

(c) — Xj + 5x 2 — 2x 3 + 3x 4 . 

4. In each case, find the product Ax of the given matrix A by the given vector x. 



x = 




1-3 

-5 

7 \ 

/ 

A 


(b) 

A = 

0 

4 

1 . x = 


3 




\- 2 

1 

3 i 


- 2 1 




( 1 

0 

_3 \ 


Ui + x 2 \ 

(c) 

A = 

' 2 

-5 

4 1, x = 

= 

Xi 

+ 2x 2 



i- 3 

1 

2 / 


^2 

: — *3 / 


5. Illustrate Results A and B of this subsection using the matrix 


the vectors 


and the number c = 4. 


/ 3-1 2\ 


A= 5 4 

-3 

I " 5 

2 I 


I X1 \ 1 

fy A 

X = 

x 2 and y = 

y 2 


W 

V 2 ! 


6 . 


In each case, find (i) the derivative and (ii) the integral from 0 to t of the vector 
function c|> that is defined. 


(a) 


4>(0 = 


5t 2 \ 

— 6 1 3 + t 2 . 

y It 2 — 5 1 J 
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7. 


(b) m 

(c) <J)(0 


/ * 3 ' \ 
(2 1 + 3)e 3 ' 

\ 


/ sin 3 1 \ 
cos 3 1 
t sin 3 1 
y cos 3 / 


Prove Results A and B of the text (page 318). 


B. Matrix Multiplication and Inversion 
DEFINITION Multiplication of Matrices 

* Let A = (a^) be an m x n matrix and B = (b fj ) be an n x p matrix . The product AB is 
defined to be the m x p matrix C = (c 0 ), where 

n 

Cij = a n b k j + a , -2 A + • • • + a in b„j = I a ik b kj (7.104) 

k = 1 

/or all i = 1, 2, . . . , m and allj= 1, 2, . . . , p. We write C = AB. 

Concerning this definition, we first note that A has m rows and n columns, B has n 
rows and p columns, and the product C = AB has m rows (the same number as A has) 
and p columns (the same number as B has). Next observe that the number of columns in 
A is equal to the number of rows in B. Matrix multiplication is defined only when this is 
the case, that is, only when the number of columns in the first matrix is the same as the 
number of rows in the second. Then the element c 0 in the ith row and j th column of the 
product matrix C = AB is the sum of the n products that are obtained by multiplying 
each element in the ith row of A by the corresponding element in the j th column of B. 


► Example 7.22 


Let 


A = 



a 


and 




Matrix A is a 3 x 2 matrix (3 rows, 2 columns), and matrix Bis a 2 x 3 matrix (2 rows, 3 
columns). Thus the number of columns in A is equal to the number of rows in B, and so 
AB is defined. Also, the number of columns in B is equal to the number of rows in A, so 
BA is defined. Thus, for these two matrices A and B, both of the products AB and BA 
are defined. 

We first find the 3 x 3 matrix AB: 


2 

-1 

\ 4 


i\ 

3 


4 2 
-1 3 



AB = 



7.5 
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/ (2)(4) + (1)(— 1) (2)(2) + (1)(3) (2)(— 2) + (1)(0)\ 

= (-l)(4) + (3)(-l) ( — 1)(2) + (3)(3) ( — 1)( — 2) + (3)(0) 

\ (4)(4) + (5)(— 1) (4)(2) + (5)(3) (4)(-2) + (5)(0)/ 

/ 7 7-4 

= -7 7 2 

\ 11 23 -8 




For the matrices A and B of Example 7.22, we note that although the products AB 
and BA are both defined, they are not of the same size and hence are necessarily not 
equal. Now suppose two matrices A and B are such that AB and BA are both defined 
and of the same size. (Note that this will always be the case when A and B themselves 
are both n x n square matrices of the same size.) We are led to inquire whether or not 
AB and BA are necessarily equal in such a case. The following example shows that they 
are not: 


► Example 7.23 


/ 2 

1 

-i\ 


3 

-2\ 

4 

3 

— 2 and B = 

0 

-1 

3 

^-6 

2 

5 I 

l 2 

1 

- 1 / 


We observe that A, B, AB, and BA are all 3 x 3 matrices. We find 

I 2 -l\/l 3 — 2\ 

AB = 4 3 -2 4 -1 3 

y — 6 2 5]\2 1 - l y 

I (2)(l) + (l)(4) + (— 1)(2) (2)(3) + (1)( — 1) + ( — 1)(1) 

= (4)(1) + (3)(4) + ( — 2)(2) (4)(3) + (3)(- 1) + (-2)(1) 

\(— 6)(1) + (2)(4) + (5)(2) ( — 6)(3) + (2)( — 1) + (5)(1) 

(2)(— 2) + (1)(3) + (— 1)(— 1)\ 
(4)(— 2) + (3)(3) + (— 2)(— 1) 
(_6)(-2) + (2)(3)+ (5)(-l)/ 

4 4 0\ 

= 12 7 3 

yl2 -15 13/ 
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and 

/I 3 — 2\ / 2 1 -1\ 

BA= 4 -1 3 4 3 -2 

^2 1 -l/\-6 2 5/ 

/ (1)(2) + (3)(4) + (-2)(-6) (1)(1)+ (3)(3) + (— 2)(2) 

= (4)(2) + (-l)(4)+ (3)( — 6) (4)(1) + ( — 1)(3) + (3)(2) 

\(2)(2)+ (1)(4) + ( — 1)( — 6) (2)(1)+ (1)(3) + (— 1)(2) 

(1) (-1)+ (3)(— 2) + ( — 2)(5)\ 

(4)(— 1) + (— 1 )( — 2) + (3)(5) 

(2) (— 1) + (l)(-2) + (-l)(5)/ 

26 6 -17\ 

= -14 7 13 . 

\ 14 3 ~ 9 I 

Thus, even though AB and BA are both of the same size, they are not equal. We write 
AB/BA in this case. 

The preceding example clearly illustrates the fact that matrix multiplication is not 
commutative. That is, we do not have AB = BA in general, even when A and B and the 
two products AB and BA are all n x n square matrices of the same size. However, we 
point out that matrix multiplication is associative , 

(AB)C = A(BC), 

and distributive, 

A(B + C) = AB + AC, 

for any three matrices A, B, C for which the operations involved are defined. 

Now consider annx« square matrix A = (a 0 ). The principal diagonal of A is the 
diagonal of elements from the upper left corner to the lower right corner of A; and the 
diagonal elements of A are the set of elements that lie along this principal diagonal, that 
is, the elements a x x , a 12 > • • • , a nn • Now, for any given positive integer n, then x n square 
matrix in which all the diagonal elements are one and all the other elements are zero is 
called the identity matrix and is denoted by I. That is, I = (a y ), where a^ = 1 for all i = j 
and a tj = 0 for i ^ j (i = 1, 2, . . . , n; j = 1,2,..., n). Thus if n = 4, the 4 x 4 identity 
matrix is 

1 1 0 0 0\ 

0100 

”0010' 

\0 0 0 1 ) 

If A is an arbitrary n x n square matrix and I is the n x n identity matrix, then it 
follows at once from the definition of matrix multiplication that 


AI = IA = A. 
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We now consider the following problem involving matrix multiplication: Given an 
n x n square matrix A, we seek another n x n square matrix B such that AB = I, where 
I is the identity matrix. In order to discuss the existence of such a matrix B, we need 
the following definition. 


DEFINITION 

Let A be an n x n square matrix. The matrix A is called nonsingular if and only if its 
determinant is unequal to zero : | A| # 0. Otherwise , the matrix A is called singular. 


► Example 7.24 

The matrix 


/ 


A = 


2 

4 

-6 


of Example 7.23 is nonsingular; for we find 
2 1 -1 

I A | = | 4 3 -2 | = 2 

-6 2 5 


-2 

5 


4 

-6 


= 4^0. 


We are now in a position to state the following basic result concerning the existence 
of a matrix B such that AB = I. 


Result C. Let A be an n x n square matrix, and let I be the n x n identity matrix. 
Then there exists a unique n x n square matrix B such that AB = BA = I if and only if A 
is nonsingular, that is, if and only if | A | ^ 0. 


DEFINITION 

Let Abe ann x n nonsingular matrix, and let I be the n x n identity matrix. The unique 
n x n matrix B such that AB = BA = I is called the inverse of A. We denote the unique 
inverse of A by A ~ 1 and thus write the defining relation between a given matrix A and its 
inverse A ~ 1 in the form 

AA 1 = A _1 A = I. (7.105) 

Now suppose that A is a nonsingular n x n matrix, so that the existence and 
uniqueness of its inverse A “ 1 is assured. We now consider the question of finding A “ 1 . 
Several distinct methods are known. We shall introduce, illustrate, and use a method 
that involves the use of determinants. Although this method is not very efficient except 
when n = 2 or n = 3, it will be sufficiently useful for our purposes in this text. In order to 
describe the procedure, several preliminary concepts will be introduced and illustrated. 
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DEFINITIONS 

Let Abe ann x n matrix, and let a V} be the element in the ith row and jth column of A. 

(1) The minor of a V] is defined to be the determinant of the (n — 1) x (n — 1) matrix 
obtained from A by deleting the ith row and jth column of A. We denote it by M \j. 

(2) The cofactor of a u is defined to be the minor M i} of a V] multiplied by the number 
(— l) l+j . We denote it by C fJ . Thus we have C tj = (— 1 ) i+j M ij . 

(3) The matrix of cofactors of the elements of A is defined to be the matrix obtained 

from A by replacing each element a^ of A by its cofactor C fJ (i = 1,2 = 1,2, 

We denote it by cof A. 

Thus the minor M tj of a^ is formally obtained simply by crossing out the row and 
column in which a tj appears and finding the determinant of the resulting matrix that 
remains. Then the cofactor C u of a y is obtained simply by multiplying the minor M fj by 
+ 1 or — 1, depending upon whether the sum i + j of the row and column of a is 
respectively even or odd. 


► Example 7.25 

Consider the 3 x 3 matrix 



We find the cofactor of the element a 23 = — 2 in the second row and third column of A. 
We first find the minor M 23 of a 23 . By definition, this is the determinant of the 2 x 2 
matrix obtained from A by deleting the second row and third column of A. Clearly this 
is 

M„ = |J ‘| -10. 

Now, by definition, the cofactor C 23 of a 23 is given by 

C 23 =(-1) 2 + 3 M 23 = -10. 

In like manner, we find the cofactors of the other elements of A. Then, replacing each 
element a u of A by its cofactor C ij9 we obtain the matrix of cofactors of A, 
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DEFINITION 

Let A = (a/j) beanm x n matrix. The transpose of A is then x m matrix B = (6 y ), where 
bjj = a jh i = 1,2 = 1,2 ,...,m. That is, the transpose of anm x n matrix A is flic 

n x m matrix obtained from A by interchanging the rows and columns of A. We denote the 
transpose of A by A r . 

For example, the transpose of the 3 x 3 matrix 



of Example 7.25 is the 3 x 3 matrix 



Note, in particular, that the transpose of the n x 1 column vector 


l x i\ 



is the 1 x n row vector 

x T = (x u x 2 ,...,x n ). 


DEFINITION 

Let A be an n x n square matrix, and let cof A be the matrix of cofactors of A. The 
adjoint of A is defined to be the transpose of the matrix of cofactors of A. We denote the 
adjoint of A by adj A, and thus write 

adj A = (cof A) r . 


► Example 7.26 

In Example 7.25 we found that the matrix of cofactors of the matrix 



-8 26 \ 
4 -10 . 

0 2 ) 


is the matrix 


/ 19 

cof A = -7 

\ 1 
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Thus the adjoint of A, being the transpose of cof A, is given by 

/ 19 -7 l\ 

adj A = (cof A) r = — 8 4 0. 

^ 26 -10 2/ 

We are now in a position to state the following important result giving a formula 
for the inverse of a nonsingular matrix. 

Result D. Let Abe an n x n nonsingular matrix . Then the unique inverse A “ 1 of A is 
given by the formula 

A" 1 = -j- (adj A), (7.106) 

I A | 

where | A| is the determinant of A and adj A is the adjoint matrix of A. 

Looking back over the preceding definitions, we observe that to find the inverse 
A -1 of a given nonsingular matrix A, we proceed as follows: 

(1) Replace each element a {j of A by its cofactor C y to find the matrix of cofactors 
c of A of A; 

(2) Take the transpose (interchange the rows and columns) of the matrix of A found 
in step (1) to find the adjoint matrix adj A of A; and 

(3) Divide each element of the matrix adj A found in step (2) by the determinant | A | 
of A. This gives the inverse matrix A -1 . 


► Example 7.27 

Consider again the 3 x 3 matrix 



already considered in Examples 7.24, 7.25, and 7.26. In Example 7.24 we found that 

I A | = 4 

and thus noted that since | A | # 0, A is nonsingular. Thus by Result C and the definition 
immediately following, we know that A has a unique inverse A " l . Now by Result D we 
know that A -1 is given by 

A" 1 = jL (adj A) = i(adj A). 

In Example 7.25 we found 

I 19 -8 26\ 

cof A = —7 4 — 10 ; 

1 0 2 ) . 
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and then, in Example 7.26, we obtained 


Thus we find 


adj A = (cof A) 7 = 


19 

-7 

1 

-8 

4 

0 

^ 26 

-10 

2 I 


A 1 = i(adj A) = ^ 


19 

-7 

1\ 

-8 

4 

0 

26 

-10 

2 


19 

7 

1\ 

T 

~4 

4 

-2 

1 

0 

13 

5 

1 

y 

~2 

2 


The reader should now calculate the products AA ~ 1 and A “ 1 A and observe that both 
of these products are indeed the identity matrix I. 

Just as in the special case of vectors, there are matrices whose elements are numbers 
and matrices whose elements are real functions defined on an interval [a, b\. We 
distinguish between these two types by means of the following definitions: 


DEFINITIONS 

(7) A matrix all of whose components are numbers is called a constant matrix. 

(2) A matrix all of whose components are real functions defined on an interval [ a , b] is 
called a matrix function. 


DEFINITION 


Let A defined by 



1 a u (t) 

<*12(0 • 


> 

II 

«21 (0 

^22(0 

■ a 2n (t) 


\«»i(0 

a n2(0 

■ a m (t)j 


be an n x n matrix function whose elements a £j -(f) (i = 1 , 2 ,..., n ; j = 1 , 2 ,..., n) are 
differentiable on an interval [a, b~\. Then A is said to be differentiable on [ a , b], and the 
derivative of A is the matrix function defined by 


d\(t) 

~dx~ 


l a ii(0 


«ln(0\ 


a 22( t ) 

a 2n(0 


a'„ 2 (t) 

(t'nn(t)j 


for all t 6 [ a , b]. 
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Thus the derivative of a given matrix function all of whose elements are 
differentiable is the matrix function obtained from the given matrix function by 
differentiating each element of the given matrix function! 


► Example 7.28 


The derivative of the matrix function A defined for all t by 


A(f) = 




is the matrix function defined for all t by 

d\(t) _ ( 6t 4e^\ 
dt \15f 2 6e 2t )' 


We close this section by stating the following useful result on the differentiation of a 
product of differentiable matrices. Note that the order of the factors in each product 
cannot be reversed, since matrix multiplication is not commutative. 


Result E. Let A and B be differentiable matrices on [a, b]. Then the product AB is 
differentiable on [a, b], and 

^a(W)]=^b(o + a(o® 


for all t e [a, b~\. 


Exercises 


In each of Exercises 1-10, given the matrices A and B, find the product AB. Also, find 
the product BA in each case in which it is defined. 


1. A 

2. A 






2 

- 3 \ 

14 

1 \ 

5. A = 

3 

-1 

0 , B = 

3 

2 


1 ° 

2 

*/ 

^1 

o/ 
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1 

2 


I 6 

4 

2 \ 

10 . A = 

4 

-1 

-2 

, B = 3 

-1 

- 3 


I" 1 

0 

V 

1° 

2 

-*/ 


1 1. Given a square matrix A, we define A 2 = AA, A 3 = AAA, and so forth. Using this 
definition, find A 2 and A 3 if 


A = 



12. Given the matrix 


/ 2 


A = 


3 3\ 
-2 1 
-1 o ) 


find A 2 + 3A + 21, where I is the 3 x 3 identity matrix. 

Find the inverse of the given matrix A in each of Exercises 13-24. 


13. 

15. 

17. 

19. 


A = 

A = 

A = 

A = 



I 4 3 l\ 

3 3 2 . 

I " 1 1 V 

3 4 7\ 

112 . 

\2 5 4 / 


14. 

16. 

18. 

20 . 
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21 . 


23. 

25. 


/3 




2 4 \ 
4 2 . 
2 3/ 

0 - 
2 
1 



22 . 


24. 


1 12 0 
A = 3 7 1 
\l 3 3 


-5 

-12 

1 

5 

-7 

-10 



Verify Result E for matrices A and B defined on an arbitrary real interval [a, f>] as 
follows: 



It 2 t l\ 

t 3 t 2 

t\ 

A(t) = 

It 3 1 t , B(t) = 

3 1 2 2 1 

1 


[ t t 2 t 3 ) 

(6t 2 

0/ 


C. Linear Independence and Dependence 

Before proceeding, we state without proof the following two theorems from algebra. 


THEOREM A 

A system of n homogeneous linear algebraic equations in n unknowns has a nontrivial 
solution if and only if the determinant of coefficients of the system is equal to zero. 


THEOREM B 

A system of n linear algebraic equations in n unknowns has a unique solution if and only if 
the determinant of coefficients of the system is unequal to zero. 


DEFINITION 

A set of m constant vectors v, . v 2 v m is linearly dependent if there exists a set of m 

numbers c l ,c 2 ,..-,c m , not all of which are zero, such that 

c l \ l + c 2 v 2 + ••• + c m y m = 0. 


► Example 7.29 

The set of three constant vectors 
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is linearly dependent, since there exists the set of three numbers 2, 3, and — 1, none of 
which are zero, such that 

2Vi + 3 v 2 + (— l)v 3 = 0. 


DEFINITION 


A set of m constant vectors is linearly independent if and only if the set is not linearly 
dependent. That is, a set of m constant vectors \ x , v 2 , . . . , v m is linearly independent if the 
relation 


implies that 


c 1 V 1 + c 2 v 2 + ••• + c m s m = 0 
Ci = c 2 = ••• = c w = 0. 


► Example 7.30 


The set of three constant vectors 



(l \ 

M 


o\ 

▼1 = 

1 , V 2 = 

2 

, and v 3 = 

2 


w 

l °J 


l 1 / 


is linearly independent. For suppose we have 

CiVi + c 2 v 2 + c 3 v 3 = 0 ; 


that is, 



This is equivalent to the system 

Ci ~c 2 = 0, 


c t + 2c 2 + 2c 3 = 0, 


Ci + c 3 = 0, 


(7.107) 


(7.108) 


of three homogeneous linear algebraic equations in the three unknowns c t , c 2 , c 3 . The 
determinant of coefficients of this system is 


1-10 
1 2 2 

1 0 1 


1 # 0 . 


Thus by Theorem A, with n = 3, the system (7.108) has only the trivial solution 
c x = c 2 = c 3 = 0. Thus for the three given constant vectors, the relation (7.107) implies 
Cj = c 2 = c 3 = 0; and so these three vectors are indeed linearly independent. 
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DEFINITION 

The set of m vector functions cj)^ <J> 2 , ..., <|> m is linearly dependent on an interval 
a < t < b if there exists a set of m numbers c t , c 2 , . . . , c m , not all zero, such that 

Ci<M0 + C 2$2 (0 + • • • + C m <(> m (t) = 0 

for all t e [a, b~\. 


► Example 7.31 


Consider the set of three vector functions <t> 1? <j) 2 , and ty 3 , defined for all t by 

/ e 2, \ 


M 

( 

\ 2e2, \ 

, <h(0 = 4e 2 ' 

\5e 2 ') 

\ii^ 2 7 


respectively. This set of vector functions is linearly dependent on any interval 
a < t < b. To see this, note that 

M 
0 ; 

W 

and hence there exists the set of three numbers 3,-1, and — 2, none of which are zero, 
such that 




/ e 2 '\ 

1 e 2 ‘) 

2e 2 ‘ 

+ (-l) 

4e 2 ' + (-2) 

e 2t 

5 e 2t j 


V lie 7 

\2e 2t ) 


3<M0 + (~ 1)<M0 + ( — 2)<t> 3 (r) = 0 


for all t e [ a , &]. 


DEFINITION 

A set of m vector functions is linearly independent on an interval if and only if the set is 
not linearly dependent on that interval. That is, a set of m vector functions <|> l9 <|> 2 , . . . , <|> w 
is linearly independent on an interval a < t < b if the relation 

ci<Mt) + c 2 <j> 2 (t) + • • • + c m <|> m (t) = o 

for all l e [«. b ] implies that 

Cl = c 2 = ■■■ = c m = 0. 


► Example 7.32 

Consider the set of two vector functions <)>! and <|> 2 , defined for all t by 
*,(»-(*) and ♦.<<>- (j„). 

respectively. We shall show that <t> x and cj) 2 are linearly independent on any interval 
a < t < b. To do this, we assume the contrary; that is, we assume that and <|> 2 are 
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linear dependent on [a, b~\. Then there exist numbers c x and c 2 , not both zero, such that 

C 1 + C 2 < t ) 2(0 = 0, 

for all t g [a, b]. Then 

c 1 e t + c 2 e 2t = 0, 
c 1 e t -f 2 c 2 e 2t = 0; 

and multiplying each equation through by e~\ we have 

Ci -f c 2 e l = 0, 
c t + 2 c 2 e l = 0, 

for all t g [a, b]. This implies that c x c 2 e l — c x + 2 c 2 e l and hence 1=2, which is an 
obvious contradiction. Thus the assumption that and ct> 2 are linearly dependent on 
[a, b\ is false, and so these two vector functions are linearly independent on that 
interval. 

Note. If a set of m vector functions c(> l5 4>2> •••> ls linearly dependent on an 
interval a <t < b, then it readily follows that for each fixed t 0 g [a, b], the 
corresponding set of m constant vectors 4> 1 (t 0 ), ty 2 {t 0 ), . . . , c () m (t 0 ) is linearly dependent. 
However, the analogous statement for a set of m linearly independent vector func- 
tions is not valid. That is, if a set of m vector functions c|> 1 9 c|> 2 , c|> m is linearly 

independent on an interval a < t < b, then it is not necessarily true that for each fixed 
t 0 g [a, 6], the corresponding set of m constant vectors <M r o), • • • > <M*o) 

linearly independent. Indeed, the corresponding set of constant vectors <Mt 0 )> 
• • • > fymito) ma y be linearly dependent for each t 0 G [a, b]. See Exercise 7 at the end 
of this section. 


Exercises 

1. In each case, show that the given set of constant vectors is linearly dependent. 



( 3\ 

/ 13\ 

/ 2\ 

(a) v t = 

-1 , v 2 = 

1 5 , v 3 = 

! 4 


l 2 I 

i- 4 / 

I" 5 / 


(2) 


l 2 \ 

/ l \ 

^Er 

< 

II 

4^ 

» v 2 = 

6 

» v 3 = - 1 

\>i 


i 8 i 

\- 2 / 


/ h 


A 


7 \ 

(c) v, = 2 

, v 2 = 

i 

| 

. V 3 = 

-1 

\- } l 


1 


I - 6/ 


2. In each case, show that the given set of constant vectors is linearly independent. 


(a) 
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1 


-1 

ri) 


a 

. V 3 = 

' 

-1 

3 I 


3. In each case, determine the value of k so that the given set of constant vectors is 
linearly dependent. 

I k\ I 1\ / 3\ 

(a) v, = 2 , v 2 = — 1 , v, = 71. 

\-l 2 1-8 


(b) v t = 3 , v 2 = -3 , v 3 = 3 . 


4. In each case, show that the set of vector functions <|> 2 , <t> 3 , defined for all t as 
indicated, is linearly dependent on any interval a < t < b. 


(a) <M0= 3^ , 4> 2 (0 = -5e 3 ‘ , <t> 3 (0 = \2e* . 

\ -c 3 W \ 5e 3 ' \ e 3t i 


f sin t + cos t 


(b) ^(t) = 2 sin t , c|> 2 (t) = 4 sin t — cos t , 


2 sin t 


( 4 cos t \ 

2 cost 

2 sin f — 4 cos t J 

5. In each case, show that the set of vector functions and <|> 2 defined for all t as 
indicated, is linearly independent on any interval a < t < b. 

/ \ / ^3r \ 


(a) <Mt) = 

(b) <M0 = 


and c|) 2 (t) = 


\ 

4e 3t J 


and cj) 2 (t) = 


6. Show that the set of two vector functions <t>i and <|> 2 defined for all t by 

/,\ /* 2 \ 


<M0 = ( o ) and ^(0 = 


respectively, is linearly independent on any interval a < t < b. 
7. Consider the vector functions and cj) 2 defined by 

... / 1\ , . , , ( te‘\ 


4>iW = I j I and <|> 2 (0 
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respectively. Show that the constant vectors <Mt 0 ) anc * <M*o) are linearly 
dependent for each t 0 in the interval 0 < t < 1, but that the vector functions ^ and 
<t> 2 are linearly independent on 0 < t < 1. 

8. Let 

/«i«\ 

a (,) = a . 2 ‘ (/ = 1,2,..., n), 

\ a «i / 

be a set of n linearly independent vectors. Show that 




a 


12 


«12 a 22 


/, «2 


« 1 „ 

a 2 * 


# 0 . 


D. Characteristic Values and Characteristic Vectors 

Let A be a given n x n square matrix of real numbers, and let S denote the set of all 
n x 1 column vectors of numbers. Now consider the equation 

Ax = Ax (7.109) 

in the unknown vector xeS, where A is a number. Clearly the zero vector 0 is a solution 
of this equation for every number A. We investigate the possibility of finding nonzero 
vectors x e S which are solutions of (7.109) for some choice of the number A. In other 
words, we seek numbers A corresponding to which there exist nonzero vectors x that 
satisfy (7. 109). These desired values of A and the corresponding desired nonzero vectors 
are designated in the following definitions. 


DEFINITIONS 

A characteristic value (or eigenvalue) of the matrix A is a number A for which the 
equation Ax = Ax has a nonzero vector solution x. 

A characteristic vector (or eigenvector) of A is a nonzero vector x such that Ax = Ax 
for some number A. 


► Example 7.33 

Consider the 2 x 2 square matrix 
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Observe that 



This is of the form Ax = lx, where A is the given 2x2 matrix, 1 = 4, and x = ( 2 ). Thus 
1 = 4 is a characteristic value of the given matrix A and x = ( 2 ) is a corresponding 
characteristic vector of A. 

On the other hand, we shall now show that 1 = 2 is not a characteristic value of this 
matrix A. For, if it were, then there would exist a nonzero vector x = (**) such that 

G ‘OGM;:) 

Performing the indicated multiplications on each side of this equation and then 
equating the corresponding components, we would have 


or simply 


6*! — 3x 2 = 2x 1? 
2*i + x 2 = 2 x 2 , 


4x x — 3 x 2 = 0, 


2x l — x 2 = 0. 


Since the determinant of coefficients of this homogeneous linear algebraic system is 
unequal to zero, by Theorem A the only solution of the system is the trivial solution 
*1 = *2 — That is, we must have 


x = 0, 


which is a contradiction. 

We proceed to solve the problem of determining the characteristic values and 
vectors of an n x n square matrix A. Suppose 

a ln\ 
a 2 n 

\ a nl a n2 ••• a nn ) 

is the given n x n square matrix of real numbers, and let 

j x i\ 

*- ■ 

w 


l a u «i2 ••• 

I a 21 a 22 ••• 
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Then Equation (7.109) may be written 


/<* 11 <*12 ••• <*ln^ 




l x i\ 

<*21 <*22 • • • <*2 n 


*2 

= A 

*i 

\<*wl <*n2 • • • a nn j 


w 


W 


and hence, multiplying the indicated entities, 


/an*! + a l2 x 2 + • 



/A*A 

a 2i*i + a 22 x 2 + • 

• + a 2n x„ 


Ax 2 

\ fl nl x l + a n2 x 2 + •• 

• + a nn x„j 


\>^*n/ 


Equating corresponding components of these two equal vectors, we have 
<*11*1 + <*12*2 + + <*!„*„ = >l*i, 

<* 21*1 + <* 22*2 + + <* 2 „*„ = > 1 * 2 , 


<*„1*1 + <*n2 *2 + “• + <*wn*n = 

and rewriting this, we obtain 

(<*n-^)*i + <*12*2 + ‘“ + <*!„*„ = 0, 

<* 21 * 1 + (<*22 - ^)*2 + '•• + <* 2 „*„ = 0 , 

(7.110) 

<*,.1*1 + <*m 2*2 + * * ' + (<*nn - ^)*n = 0. 

Thus we see that (7. 1 09) holds if and only if (7. 1 1 0) does. Now we are seeking nonzero 
vectors x that satisfy (7.109). Thus a nonzero vector x satisfies (7.109) if and only if its 
set of components x 1 ,x 2 ,...,x w is a nontrivial solution of (7.110). By Theorem A of 
Section 7.5C, the system (7.1 10) has nontrivial solutions if and only if its determinant of 
coefficients is equal to zero, that is, if and only if 


<*n “ A 

<*12 

<*1„ 



<*21 

<*22 — ^ 

<*2« 

= 0. 

(7.111) 

<*nl 

a n2 

* <*„„ - * 




It is easy to see that (7.1 1 1) is a polynomial equation of the nth degree in the unknown A. 
In matrix notation it is written 


| A - AI| = 0, 

where I is the n x n identity matrix (see Section 7.5B). Thus Equation (7.109) has a 
nonzero vector solution x for a certain value of A if and only if A satisfies the nth-degree 
polynomial equation (7.111). That is, the number A is a characteristic value of the 
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matrix A if and only if it satisfies this polynomial equation. We now designate this 
equation and also state the alternative definition of characteristic value that we have 
thus obtained. 


DEFINITION 

Let A = ( ciij ) be an n x n square matrix of real numbers . The characteristic equation of 
A is the nth-degree polynomial equation 


a ll — ^ 

a l2 

a ln 



a 2l 

#22 — ^ 

a 2n 

= 0 

( 7 . 111 ) 

a nl 

a n2 

a nn — X 




in the unknown X; and the characteristic values of A are the roots of this equation . 

Since the characteristic equation (7.111) of A is a polynomial equation of the nth 
degree, it has n roots. These roots may be real or complex, but of course they may or 
may not all be distinct. If a certain repeated root occurs m times, where 1 cm <n, then 
we say that that root has multiplicity m. If we count each nonrepeated root once and 
each repeated root according to its multiplicity, then we can say that the n x n matrix A 
has precisely n characteristic values, say X t , X 2 , . . . , X n . 

Corresponding to each characteristic value X k of A there is a characteristic vector 
x k (k = 1, 2,..., n). Further, if \ k is a characteristic vector of A corresponding to 
characteristic value X k , then so is cx k for any nonzero number c. We shall be concerned 
with the linear independence of the various characteristic vectors of A. Concerning 
this, we state the following two results without proof. 

Result F. Suppose each of the n characteristic values X t , X n of the n x n 

square matrix A is distinct ( that is , nonrepeated); and let x 1 ,x 2 ,...,x„ be a set of 
n respective corresponding characteristic vectors of A. Then the set of these n charac- 
teristic vectors is linearly independent. 

Result G. Suppose then x n square matrix A has a characteristic value of multiplicity 
m, where 1 < m < n. Then this repeated characteristic value having multiplicity m has p 
linearly independent characteristic vectors corresponding to it, where 1 < p < m. 

Now suppose A has at least one characteristic value of multiplicity m, where 1 < 
m < n; and further suppose that for this repeated characteristic value, the number p 
of Result G is strictly less than m; that is, p is such that 1 < p < m. Then corresponding 
to this characteristic value of multiplicity m, there are less than m linearly independent 
characteristic vectors. It follows at once that the matrix A must then have less than n 
linearly independent characteristic vectors. Thus we are led to the following result: 


Result H. If the n x n matrix A has one or more repeated characteristic values, then 
there may exist less than n linearly independent characteristic vectors of A. 



7.5 MATRICES AND VECTORS 341 


Before giving an example of finding the characteristic values and corresponding 
characteristic vectors of a matrix, we introduce a very special class of matrices whose 
characteristic values and vectors have some interesting special properties. This is the 
class of so-called real symmetric matrices, which we shall now define. 


DEFINITION 


A square matrix A of real numbers is called a 


real symmetric matrix of A T = A. 


For example, the 3 x 3 square matrix 


A = 


/-! 


-l 

o 

3 


4\ 

3 


is a real symmetric matrix, since A r = A. 

Concerning real symmetric matrices, we state without proof the following inter- 
esting results: 


Result I. All of the characteristic values of a real symmetric matrix are real numbers. 

Result J. If A is an n x n real symmetric square matrix, then there exist n linearly 
independent characteristic vectors of A, whether the n characteristic values of A are all 
distinct or whether one or more of these characteristic values is repeated. 


► Example 7.34 


Find the characteristic values and characteristic vectors of the matrix 



Solution. The characteristic equation of A is 


Evaluating the determinant in the left member, we find that this equation may be 
written in the form 

A 2 - 3A - 4 = 0 
or 

(A - 4)(A + 1) = 0. 

Thus the characteristic values of A are 


A = 4 and A = — 1. 
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The characteristic vectors x corresponding to a characteristic value X are the nonzero 
vectors 


With X = 4, this is 



such that Ax = Xx. 



Performing the indicated multiplications and equating corresponding components, we 
see that x t and x 2 must satisfy 

x t + 2x 2 = 4x 1? 

3x x + lx 2 = 4x 2 ; 

that is, 

3x x = 2x 2 , 

3x! = 2 x 2 . 


We see at once that x x = 2k, x 2 = 3k is a solution for every real k. Hence the 
characteristic vectors corresponding to the characteristic value X = 4 are the vectors 



where k is an arbitrary nonzero number. In particular, letting k = 1, we obtain the 
particular characteristic vector 



corresponding to the characteristic value X = 4. 

Proceeding in like manner, we can find the characteristic vectors corresponding to 
X = - 1. With X = - 1 and x = (*‘), the basic equation Ax = Xx is 


1 2 
3 2 



We leave it to the reader to show that we must have x 2 = — x t . Then x t = k, x 2 = — k 
is a solution for every real k. Hence the characteristic vectors corresponding to the 
characteristic value X = — 1 are the vectors 


x = 



where k is an arbitrary nonzero number. In particular, letting k = 1, we obtain the 
particular characteristic vector 



corresponding to the characteristic value X = — 1 . 



7.5 MATRICES AND VECTORS 343 


► Example 7.35 

Find the characteristic values and characteristic vectors of the matrix 

7 -1 6\ 

-10 4 -12 . 

-2 1 -lj 

Solution. The characteristic equation of A is 

7-A -1 6 

-10 4-A -12 =0. 

-2 1 -l- A 

Evaluating the determinant in the left member, we find that this equation may be 
written in the form 

A 3 - 10/l 2 + 3U — 30 = 0. 
or 

(A - 2)(A - 3 )(A - 5) = 0. 

Thus the characteristic values of A are 

A = 2, A = 3, and A = 5. 

The characteristic vectors x corresponding to a characterictic value A are the nonzero 
vectors 

(x i 

X = x 2 

\ x 3 

With A = 2, this is 

( 7 —1 6\{ x i\ j x i 

— 10 4 — 12 x 2 I = 2 1 x 2 

[ -2 1 - 1 I\ X 3/ \^3 

Performing the indicated multiplications and equating corresponding components, we 
see that x l9 x 2 , x 3 must be a nontrivial solution of the system 

7x x — x 2 + 6x 3 = 2x 1? 

— 10Xi + 4 x 2 — 12x 3 = 2x 2 , 

— 2x x + x 2 — x 3 = 2 x 3 ; 

that is, 

5xj — x 2 + 6 x 3 = 0, 

— 10x! + 2 x 2 — 12x 3 = 0, 

— 2x x + x 2 — 3 x 3 = 0. 

Note that the second of these three equations is merely a constant multiple of the first. 
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Thus we seek nonzero numbers x l9 x 2 , x 3 that satisfy the first and third of these 
equations. Writing these two as equations in the unknowns x 2 and x 3 , we have 

— x 2 + 6x 3 = — 5x t , 

x 2 — 3x 3 = 2x t . 

Solving for x 2 and x 3 , we find 

x 2 = — x t and x 3 = —x i . 

We see at once that x t = k, x 2 = — k, x 3 = —k is a solution of this for every real k. 
Hence the characteristic vectors corresponding to the characteristic value X = 2 are the 
vectors 



where k is an arbitrary nonzero number. In particular, letting k = 1, we obtain the 
particular characteristic vector 

/ 1 
-1 

l - 1 

corresponding to the characteristic value X = 2. 

Proceeding in like manner, one can find the characteristic vectors corresponding 
to X = 3 and those corresponding to X = 5. We give only a few highlights of these 
computations and leave the details to the reader. We find that the components x 1? x 2 , 
x 3 of the characteristic vectors corresponding to X = 3 must be a nontrivial solution of 
the system 

4x t — x 2 + 6x 3 = 0, 

— 10x x + x 2 — 12 x 3 = 0, 

— 2xj + x 2 — 4x 3 = 0. 

From these we find that 


x 2 = — 2x l and x 3 = — x 1? 


and hence x x = k, x 2 = —2k, x 3 = —k is a solution for every real k. Hence the 
characteristic vectors corresponding to the characteristic value X = 3 are the vectors 


x = 


\ 


k 

-2k 

-k 


where k is an arbitrary nonzero number. In particular, letting k = 1, we obtain the 
particular characteristic vector 



corresponding to the characteristic value X = 3. 
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Finally, we proceed to find the characteristic vectors corresponding to X = 5. We find 
that the components x 1? x 2 , x 3 of these vectors must be a nontrivial solution of the 
system 

2x i — x 2 + 6x 3 = 0, 

— lOXi — x 2 — 12x 3 = 0, 

— 2x { + x 2 — 6x3 = 0. 

From these we find that 


x 2 = — 2x l and 3x 3 = —2x i . 


We find that x t = 3 k, x 2 = — 6k, x 3 = —2k satisfies this for every real k. Hence the 
characteristic vectors corresponding to the characteristic value X = 5 are the vectors 


/ 3 k\ 



where k is an arbitrary nonzero number. In particular, letting k = 1, we obtain the 
particular characteristic vector 



corresponding to the characteristic value X = 5. 


Exercises 


In each of Exercises 1-14 find all the characteristic values and vectors of the matrix. 




346 SYSTEMS OF LINEAR DIFFERENTIAL EQUATIONS 



l-l 

5 5\ 


1-2 

6 

-18\ 

13. | 


4 .5 . 

14. 

12 

-23 

66 


\ 3 

-3 2/ 


\ 5 

-10 

29 J 


7.6 THE MATRIX METHOD FOR HOMOGENEOUS LINEAR SYSTEMS 
WITH CONSTANT COEFFICIENTS: TWO EQUATIONS IN 
TWO UNKNOWN FUNCTIONS 

A. Introduction 


We now return to the homogeneous linear systems of Section 7.4 and obtain solu- 
tions using matrix methods. In anticipation of more general systems, we change nota- 
tion and consider the homogeneous linear system in the form 


dx i 

—7— = o ii x l + a l2 x 2 , 


dx 2 

—fa — a 2l X l + a 22 X 2> 


(7.112) 


where the coefficients a u , a i2 , a 2l , a 22 are real constants. 

We shall now proceed to express this system in a compact manner using vectors and 
matrices. We introduce the 2 x 2 constant matrix of real numbers, 


and the vector 


f a 11 a l2\ 
\ a 21 a 22 / 



Then by definition of the derivative of a vector, 


dx 

dt 


and by multiplication of a matrix by a vector, we have 


Id xA 

~di 

dx 2 

VdT 


(7.113) 


(7.114) 


Ax _ ( a n «i 2 V*iW«ii*i + a l2 x 2 \ 

\ a 2i a 22 J\x 2 J \ a 2 i^i + a 22 x 2 J 

Comparing the components of dx/dt with the left members of (7.112) and the 
components of Ax with the right members of (7.1 12), we see that system (7.1 12) can be 
expressed as the homogeneous linear vector differential equation 
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The real constant matrix A that appears in (7. 1 1 5) and is defined by (7. 1 1 3) is called the 
coefficient matrix of (7. 1 1 5). 

We seek solutions of the system (7.112), that is, of the corresponding vector 
differential equation (7. 1 1 5). We proceed as in Section 7.4A, but now employing vector 
and matrix notation. We seek nontrivial solutions of the form 


x i = a i eAt > 

x 2 = 0L 2 e kt , 


where a 1? a 2 , and X are constants. Letting 


(7.116) 


a = 



and 



we see that the vector form of the desired solution (7.116) is x = ae kt . Thus we seek 
solutions of the vector differential equation (7.1 15) of the form 


x = a e kt , (7.117) 

where a is a constant vector and X is a number. 

Now substituting (7.117) into (7. 1 1 5) we obtain 

Xae kt = Xcne kt 


which reduces at once to 


and hence to 


Aa = X<x 


(7.118) 


(A - Xl)<x = 0, 

where I is the 2 x 2 identity matrix. Written out in terms of components, this is the 
system of two homogeneous linear algebraic equations 


(a n - X)oc 1 + a l2 a 2 = 0, 
a 2i a i + (a 22 - X)(X 2 = 0, 


(7.119) 


in the two unknowns oc l and a 2 . By Theorem A of Section 7.5C, this system has a 
nontrivial solution if and only if 


that is, in matrix notation, 


a n A 

a 2l 


a l2 

#22 X 


= 0, 


| A — All = 0. 


(7.120) 

(7.121) 


Looking back at Section 7.5D, we recognize (7.120) as the characteristic equation of 
the coefficient matrix A = (# fj ) of the vector differential equation (7. 1 1 5). Expanding the 
determinant in (7.120), we express the characteristic equation (7.120) as the quadratic 
equation 

X 2 — (# n + # 22 )^ + ( a u a 22 — a l2 a 2l ) = 0 (7.122) 


in the unknown X. We recall that the roots X t and X 2 of this equation are the 
characteristic values of A. Substituting each characteristic value X h (i = 1, 2), into 
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system (7.1 19), or the equivalent vector equation (7.1 18), we obtain the corresponding 
nontrivial solution 


a 


(0 = 



(i = 1, 2), of (7.119). We recognize the vector a (0 as the characteristic vector corre- 
sponding to the characteristic value X h ( i = 1,2). 

We thus see that if the vector differential equation 


has a solution of the form 


dx 

— - = Ax 
dt 

(7.115) 

X 

II 

« 

(7.117) 


then the number X must be a characteristic value X { of the coefficient matrix A and the 
vector a must be a characteristic vector a (l) corresponding to this characteristic value X t . 


B. Case of Two Distinct Characteristic Values 


Suppose that the two characteristic values X i and X 2 of the coefficient matrix A of the 
vector differential equation (7.115) are distinct, and let a (1) and a (2) be a pair of 
respective corresponding characteristic vectors of A. Then the two distinct vector 
functions x (1) and x (2) defined, respectively, by 

x (1) (t) = cn {l) e Xlt , x (2) (t) = cn {2) e X2t (7.123) 

are solutions of the vector differential equation (7.1 15) on every real interval [a, b~\. We 
show this for the solution x (1) as follows: From (7.1 18), we have 

A l0 t (1) = Aa (1) ; 

and using this and the definition (7.123) of x (1) (t), we obtain 

= Aa (1) e Al ' = Ax' 1 '(f), 
dt 

which states that x (1) (t) satisfies the vector differential equation (7.115) on [a, b]. 
Similarly, one shows that x (2) is a solution of (7.1 15). 

The Wronskian of solutions x (1) and x (2) is 


fF(x (1) , x (2) )(f) 


a n e Xlt 

cc l2 e x * 

a 2l e x '‘ 

a 22 eX2t 


+ x 2 )t 

a ll 

<*12 


<*21 

<*22 


By Result F of Section 7.5D, the characteristic vectors a (1) and a (2) are linearly 
independent. Therefore, using Exercise 8 at the end of Section 7.5C, we have 


|<*21 <* 2 2 | 

Then since e {Xl + Xl)t # 0 for all t, we have JF(x (1) , \ {2) )(t) # 0 for all t on [a, b~\. Thus by 
Theorem 7.4 the solutions x (1) and x (2) of (7.115) defined by (7.123) are linearly 
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independent on [a, b~\; and so a general solution is given by + c 2 x (2) where c t 
and c 2 are arbitrary constants. We summarize the results obtained in the following 
theorem. 


THEOREM 7.10 


Consider the homogeneous linear system 

dx i 

—77 ~ a ll x l + a l2 X 2> 


dx 2 

= a 2l X l + a 22 x 2i 

that is, the vector differential equation 



(7.112) 


(7.115) 


where 



and a ll ,a l2 ,a 2l ,a 22 are real constants. 

Suppose the two characteristic values X i and A 2 of A are distinct ; and let a (1) and a (2) be 
a pair of respective corresponding characteristic vectors of A. 

Then on every real interval, the vector functions defined by a (1) e Alf and <x (2) e Alt form a 
linearly independent set of solutions of (7.115); and 

x = c x n (l) e Xxt + c 2 a {2) e* 2t , 

where c t and c 2 are arbitrary constants, is a general solution of (7.115) on [ a,b\ 


► Example 7.36 


Consider the homogeneous linear system 

dx t 


dt 
dx 2 

~di 


= 6x t — 3 x 2 , 
= 2x x + x 2 , 


that is, the vector differential equation 
dx /6 — 3 


, , ^ , )x, where x = 

dt \2 1/ \x 2 

The characteristic equation of the coefficient matrix A = (f ~f) is 

|6 — A -3 


AI| = 


1 - A 


= 0 . 


(7.124) 


(7.125) 
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Expanding the determinant and simplifying, this takes the form X 2 — IX + 12 = 0 
with roots X t = 3, X 2 = 4. These are the characteristic values of A. They are distinct 
(and real), and so Theorem 7.10 applies. We thus proceed to find respective 
corresponding characteristic vectors a (1) and a (2) . We use (7.1 18) to do this. 

With X = X l = 3 and ot = a (1) = (**), (7.118) becomes 




= 3 



from which we at once find that ol 1 and a 2 must satisfy 


6a { — 3a 2 = 3a 1? oc t = a 2 , 

or 

2a i + a 2 = 3a 2 , ot i = a 2 . 

A simple nontrivial solution of this system is obviously = a 2 = 1, and thus a 
characteristic vector corresponding to X x = 3 is 



Then by Theorem 7.10, 

x = ^e 3r , that is, x = (^ e 3 r^, (7.126) 

is a solution of (7.125). 

For X = X 2 = 4 and a = a (2) = (“^), (7.118) becomes 



from which we at once find that ol x and a 2 must satisfy 

6a l — 3a 2 = 4oc { , 2a t = 3a 2 , 

or 

2oc t +a 2 = 4a 2 , 2a x = 3a 2 . 

A simple nontrivial solution of this system is obviously cc l = 3, a 2 = 2, and thus a 
characteristic vector corresponding to X 2 = 4 is 



Then by Theorem 7.10, 

x = Q e4 '’ thatis ’ x = (1A21) 

is a solution of (7.125). 

Also by Theorem 7.10 the solutions (7.126) and (7.127) of (7.125) are linearly 
independent, and a general solution is 
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where c i and c 2 are arbitrary constants. That is, in scalar language, a general solution 
of the homogeneous linear system (7.124) is 

= c t e 3t + 3 c 2 e* t , x 2 = c t e 3t + 2 c 2 e 4 ', 
where c t and c 2 are arbitrary constants. 

The system (7.124) of this example is, aside from notation, precisely the system (7.79) 
of Example 7.17. A comparison of these two illustrations provides useful insight. 

Let us return to the homogeneous linear system (7.112) and further consider the 
result stated in Theorem 7.10. In that theorem we stated that if the two characteristic 
values A x and A 2 of A are distinct and if a (1) and a (2) are two respective corresponding 
characteristic vectors of A, then the two functions defined by 

OL il) e Xlt and v. {2) e Xlt 

constitute a pair of linearly independent solutions of (7.112). Note that although we 
assume that A x and A 2 are distinct , we do not require that they be real. Thus the case of 
distinct complex characteristic values is also included here. Since the coefficients in 
system (7.112) are real, if complex characteristic values exist, then they must be a 
conjugate-complex pair. 

Suppose = a + bi and A 2 = a — bi are a pair of conjugate-complex characteristic 
values of the coefficient matrix A of system (7.112). Then the corresponding linearly 
independent solutions 

a (l ) e (n + bi)t anc j y(2) e (a-bi)t 

are complex solutions. In such a case, we may use Euler’s formula e 10 = cos 6 + i sin 6 
to replace this linearly independent complex pair of solutions by a related linearly 
independent real pair of solutions. This is accomplished exactly as explained in Section 
7.4C and is illustrated in Example 7.18. 


C. Case of a Double Characteristic Value 

Suppose that the two characteristic values A t and k 2 of the coefficient matrix A of the 
vector differential equation (7.115) are real and equal. Let A denote this common 
characteristic value, and let a be a corresponding characteristic vector. Then just as in 
the previous subsection, it is readily shown that the vector function defined by ae Xt is a 
solution of (7. 1 1 5) on every real interval [a, fc] . But now, except in the special subcase in 
which a tl = a 22 # 0, a l2 = a 2l = 0, there is only one solution of this form. Looking 
back at the results of Section 7.4D and expressing them in vector notation, we would 
now seek a second solution of the form 

x = (yr + f)e*, (7.128) 

where y and P are to be determined so that this is indeed a solution. 

We thus assume a solution of differential equation (7.115) of this form (7.128), 
differentiate, and substitute into (7.1 15). We at once obtain 

(y t + P)Ae Xt + ye Xt = A(yt + ft)e Xt . 

Then dividing through by e Xt # 0 and collecting terms in powers of t , we readily find 
(Ay - Ay)t + (Ap + y - AP) = 0. 
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This holds for all t e [a, b~\ if and only if 

Xy — Ay = 0, 
Xfi + y - Ap = 0. 


(7.129) 


The first of these gives Ay = Xy or (A — Xl)y = 0. Thus we see that y is, in fact, a 
characteristic vector a of A corresponding to characteristic value X. The second of 
(7.129) with y = a gives Ap — X$ - a, from which we have 

(A — AI)p = a (7.130) 


as the equation for the determination of P in (7.128). Thus in the assumed solution 
(7.128), the vector y is, in fact, a characteristic vector of A corresponding to 
characteristic value X , and the vector p is determined from (7.130). Direct substitution 
of (7.128) with these choices of y and p verifies that it is indeed a solution. Moreover, it 
can be shown that the two solutions thus obtained, 

ae Xt and (at + $)e Xt , 


are linearly independent; therefore, a linear combination of them constitutes a general 
solution of (7.115). We summarize these results in the following theorem. 


THEOREM 7.11 


Consider the homogeneous linear system 


dx i 

—- = a ll x l + a l2 x 2 , 


dx 2 

~ a 21 X l + a 22 X 2i 


(7.112) 


that is, the vector differential equation 

f-Ax, (7.115) 

where 



and a tl , a l2 , a 2l , a 22 are real constants which are not such that a n = a 22 # 0, 
a l2 = a 2 i = 0. Suppose the two characteristic values X l and X 2 of A are real and equal ; 
and let X denote their common value. Let a be a corresponding characteristic vector of A 
and let P be a vector satisfying the equation 

(A — AI)p = a. (7.130) 

Then on every real interval, the vector functions defined by 

ae Xt and (at + fl)e Xt (7.131) 

form a linearly independent set of solutions of (7.115); and 

x = cpze Xt + c 2 (at + f)e Xt , 


where c x and c 2 are arbitrary constants, is a general solution of (7.115) on [ a , bf 
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► Example 7.37 


Consider the homogeneous linear system 


dx t 

~di 


= 4xj — x 2 


dx 2 

~di 


= Xj + 2 x 2 , 


(7.132) 


that is, the vector differential equation 
dx (4 — 1\ 

-r- = , „ x, where x = 

dt \1 2) V a 2 

The characteristic equation of the coefficient matrix 

( 4 -1 


A = 


is 


| A — AI| = 


4- A -1 
1 2 - A 


= 0 . 


(7.133) 


Expanding the determinant and simplifying, this takes the form A 2 — 6A + 9 = 0, with 
the double root A = 3. That is, the characteristic values of A are real and equal and so 
Theorem 7.11 applies. 

Proceeding to apply it, we first find a characteristic vector a corresponding to the 
characteristic value A = 3. With A = 3 and a = (*j), (7.1 18) becomes 





from which we at once find that a t and a 2 must satisfy 


4a t — a 2 = 


a t + 2a 2 ~ 3a 2 , 


«i = «i = a 2 , 
or 

a i —a, = a 2 . 


A simple nontrivial solution of this system is obviously = a 2 = 1, and thus a 
characteristic vector corresponding to A = 3 is 



Then by Theorem 7.1 1, 

x = (j)e 3 ', that is, x = ^ 3( ^, (7.134) 

is a solution of (7.133). 

By Theorem 7.11, a linearly independent solution is of the form (a t + f>)e Xt , where 
a = (1)> ^ = 3, and p satisfies (A — AI)P = a. Thus p = ($*) satisfies 
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which quickly reduces to 



From this we at once find that ^ and p 2 must satisfy 


Pl~P 2 = l 
Pl-Pl= I- 


A simple nontrivial solution of this system is = 1, = 0. Thus we find the desired 

vector 



Then by Theorem 7.1 1, 


x = 




e 3 \ that is, 


(t + l)e 3 '\ 
te 3 ‘ y 


(7.135) 


is a solution of (7.133). 

Finally, by the same theorem, solutions (7.134) and (7.135) are linearly independent, 
and a general solution is 


X = 



+ c 2 


(t + l)e 3 '\ 
te 3 ‘ ) 


where c, and c 2 are arbitrary constants. That is, in scalar language, a general solution 
of the homogeneous linear system (7.132) is 

Xi = c^ 3 ' + c 2 (t + l)e 3 ', 

x 2 = c t e 3t + c 2 te 3 \ 


where c x and c 2 are arbitrary constants. 


Exercises 

Find the general solution of each of the homogeneous linear systems in Exercises 1-18, 
using the vector-matrix methods of this section, where in each exercise 
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9. 

11 . 

13 . 

15 . 

17 . 



10 . 

12 . 

14. 

16 . 

18 . 


dx 

It 

dx 

Hi 

dx 

dt 

dx 

It 




x 


X 


X 


X 



X 


7.7 THE MATRIX METHOD FOR HOMOGENEOUS LINEAR SYSTEMS WITH 
CONSTANT COEFFICIENTS: n EQUATIONS IN n UNKNOWN FUNCTIONS 

A. Introduction 

In this section we extend the methods of the previous section to a homogeneous linear 
system of n first-order differential equations in n unknown functions and having real 
constant coefficients. More specifically we consider a homogeneous linear system of 
the form 


dx t 

~dt 


= an*! + a 12 x 2 + • ' ' + a ln x„. 


dx 2 

~7~ = a 2l X l + a 22 X 2 + • • • + d2n X n> 


( 7 . 136 ) 


dx„ 

—J— - a nl X l + a n2 X 2 + ‘ ' ' + a nn X , i> 


where the coefficients a ;j -, (i = 1,2 ,...,n;j= 1,2 , . . ., n), are real constants. 

We proceed to express this system in vector-matrix notation. We introduce the 
n x n constant matrix of real numbers 

l a n «i2 
^ _ <l2l a 22 

^ a n\ a n2 

and the vector 


l x i\ 



«i»\ 

a 2n 

a nn I 


( 7 . 137 ) 


(7.138) 
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Then by definition of the derivative of a vector, 

dt 1 

dx 2 

~di ■ 

5 

. d*n 

U/ 

and by multiplication of a matrix by a vector, we have 


dx 

dt 



/«n 

a l2 ' 


l Xl \ 


fa nXj + a 12 x 2 + ■ 


Ax = 

1 «21 
i : 

a 22 ‘ 

" a 2 n 

x 2 1 

= 

I a 21 X! + a 22 x 2 + • 

• + a 2n x„ 


\ a nl 

a «2 ‘ 

" a nn J 

\ x "! 


\a n iXi + a„ 2 x 2 + •• 

T a nn x n j 


Comparing the components of dx/dt with the left members of (7.136) and the 
components of Ax with the right members of (7.136), we see that system (7.136) can be 
expressed as the homogeneous linear vector differential equation 



(7.139) 


The real constant matrix A that appears in (7.139) and is defined by (7.137) is called the 
coefficient matrix of (7.139). 


DEFINITION 


By a solution of the system (7.136), that is, of the vector differential equation (7.139), we 
mean ann x 1 column-vector function 

/<M 


4 > 2 

*/ 


\ 

whose components </>i , f> 2 > • • • > eac h have a continuous derivative on the real interval 

a < t < b, and which is such that 


# i(0 

dt 

#2 (0 

dt 


= an<M0 + «i2^ 2 (0 + ' ' • + aiM), 
= a 2 i4>i(t) + a 22 4> 2 (t ) + ■■■ + a 2 „4>„(t). 


d(p n (t) 


= + a n2<M0 + ••' + d m (j) n (t), 


dt 
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for all t such that a < t < b. In other words , the components <j) u 0 2 , . . . , 0„ of 0 are such 
that 


X 2 = <j> 2 (t) 


x„ = 

simultaneously satisfy all n equations of the system (7.136) identically on a < t < b. 

We proceed to introduce the concept of a general solution of system (7.136). In the 
process of doing so, we state several pertinent theorems. The proofs of these results will 
be found in Section 11.2. Our first basic result is the following. 


THEOREM 7.12 

Any linear combination of n solutions of the homogeneous linear system (7.136) is itself a 
solution of the system (7.136). 

Before going on to our next result, the reader should return to Section 7.5C and 
review the concepts of linear dependence and linear independence of vector functions. 

We now state the following basic theorem concerning sets of linearly independent 
solutions of the homogeneous linear system (7.136). 


THEOREM 7.13 

There exist sets of n linearly independent solutions of the homogeneous linear system 
(7.136). Every solution of the system (7.136) can be written as a linear combination of any 
n linearly independent solutions of (7.136). 

As a result of Theorems 7.12 and 7.13, we now give the following definition of a 
general solution for the homogeneous linear system (7.136) of n equations in n 
unknown functions. 


DEFINITION 

Let 



(<t>u\ 


/M 


1 01 n\ 

<J>1 = 

<t> 21 

,<t>2 = 

'<P22 

>•••><!>« = 

02 n 




\4>nl) 


\ <l>nn 1 


be n linearly independent solutions of the homogeneous linear system (7.136). Let 
c i,c 2 ,...,c n be n arbitrary constants. Then the solution 

X = c ! <(>! (0 + C 2 <j> 2 (t) + “* + 
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that is, 

Xi =c 1 ^ u (f) + c 2 0 12 (f) + ••• + cj ln {t), 
*2 = C t 4 > 2 l(t) + C 2 4 > 2 2 (t) + ••• + C n <t> 2 n (t), 


X„ = C t (l> nl (t) + c 2 <t>„ 2 (t) + • • • + cj nn (t), 
is called a general solution of the system (7.136). 

In order to state a useful criterion for the linear independence of n solutions of the 
system (7.136), we introduce the following concept. 


DEFINITION 


Consider the n vector functions c|) 1 , c() 2 ,.. 

. , ((>„ defined, respectively, by 

♦ i (0 = 

0 2 l (0 

,+ 2(0 = 

/ 012 ( O \ 
' 022(0 

,...,+.(0 = 

1 01.(0 \ 
02.(0 

The n x n determinant 

\ 4 > nl ( t)j 


^ 0 . 2(0 j 


^ 0 . 1 ( 0 , 



1011 012 •" 

0 i .| 



021 022 ■” 0 2n 
| 0»,1 0*2 0 ** 


(7.140] 


(7.141) 


is called the Wronskian of the n vector functions cj) 2 ,. . . , (|)„ defined by (7.140). We 
will denote it by W(<|>i,<|> 2 >-- • >0*) an d its value at t by W(ty i ,ty 2 ,. • -5<l>n)(0- 


We may now state the following useful criterion for the linear independence of n 
solutions of the homogeneous linear system (7.136). 


THEOREM 7.14 

n solutions • .,<!>„ of the homogeneous linear system (7.136) are linearly 

independent on an interval a < t < b if and only if 

^(01,02,...,0J(O/O 

for all t e [a, bf 

Concerning the values of W( cj)^ <t> 2 , ...,<!>„), we also state the following result. 


THEOREM 7.15 

Let <|> 2 , be n solutions of the homogeneous linear system (7.136) on an interval 
a < t < b. Then either W(<\> l ,ty 2 ’- • -><l>n)(0 = 0 for all t e [ a,b ] or W(ty l ,ty 2 ,--->$n)( t ) 
= 0 for no t e [a,b]. 
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Having introduced these basic concepts and results, we now seek solutions of the 
system (7.136). We shall proceed by analogy with the presentation in Section 7.6A. 
Doing this, we seek nontrivial solutions of system (7.136) of the form 


x 2 = oc 2 e x \ 
x n = cc n e Xt , 

where cl u a„, and X are constants. Letting 




1 

i x i\ 

a = 


and x = 

X 2 

i ; 




\ x "l 


we see that the vector form of the desired solution (7.142) is 


(7.142) 


(7.143) 


x = *e 


x t 


Thus we seek solutions of the vector differential equation (7.139) which are of the form 

x = *e Xt , (7.144) 

where a is a constant vector and X is a number. 

Now substituting (7.144) into (7.139), we obtain 

X*e Xt = A*e Xt 

which reduces at once to 

Aa = Aa (7.145) 


and hence to 


(A - XI)* = 0, 


where I is the n x n identity matrix. Written out in terms of components, this is the 
system of n homogeneous linear algebraic equations 

(a n - X)oc t + a 12 a 2 +-**+ a ln oc„ = 0, 

a 2i a i + ( a 22 - ^) a 2 + ”• + a 2 = 0, (7.146) 

a nl a l + a n 2<X 2 + -- - +( a nn- = 

in the n unknowns a l5 ot n . By Theorem A of Section 7.5C, this system has a 

nontrivial solution if and only if 


a n — X 

a l2 

a ln 


*21 

Cl 22 — ^ 

a 2n 

= 0; (7.147) 

a n 1 

a n2 

• a nn - X 



that is, in matrix notation, 


|A — Xl\ = 0. 
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Looking back at Section 7.5D, we recognize Equation (7.147) as the characteristic 
equation of the coefficient matrix A = (a 0 ) of the vector differential equation (7.139). 
We know that this is an nth-degree polynomial equation in A , and we recall that its 
roots X u A 2 ,...,A B are the characteristic values of A. Substituting each characteristic 
value X h (i = 1,2,..., n), into system (7.146), we obtain the corresponding nontrivial 
solution 


a i = ot u ,ot 2 = ot 2h ...,ot n = oc ni , 

(i = 1, 2, . . . , n), of system (7. 146). Since (7. 146) is merely the component form of (7. 145), 
we recognize that the vector defined by 


ot (i) = 


M 

a 2i 


\ a "7 


(i= 1,2 ,...,«), 


(7.148) 


is a characteristic vector corresponding to the characteristic value X t (i = 1, 2, . . . , n). 
We thus see that if the vector differential equation 


dx 

It 


= Ax 


(7.139) 


has a solution of the form 

x = ae Xt (7.144) 

then the number X must be a characteristic value A f of the coefficient matrix A and the 
vector a must be a characteristic vector a (0 corresponding to this characteristic value A*. 


B. Case of n Distinct Characteristic Values 

Suppose that each of the n characteristic values A l5 A 2 ,..., X n of the n x n square 
coefficient matrix A of the vector differential equation is distinct (that is, nonrepeated); 
and let a (1) , a (2) , . . . ,a (f,) be a set of n respective corresponding characteristic vectors of 
A. Then the n distinct vector functions x x , x 2 , . . . , x„ defined, respectively, by 

x (1) (t) = a (1) c Alf ,x (2) (t) = a (2) c A2t ,...,x (n) (0 = a in) e Xnt (7.149) 

are solutions of the vector differential equation (7. 1 39) on every real interval [a, fc]. This 
is readily seen as follows: F rom (7. 1 45), for each i = 1 , 2, we have 

A,a (l) = Aa (i) ; 

and using this and the definition (7.149) of x (I) (t), we obtain 

^ = Xp {i) e kit = Aa (I) e A,f = Ax (0 (t), 
dt 

which states that x (0 (r) satisfies the vector differential equation 

dx 

— = Ax, (7.139) 


on [ a , b]. 
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Now consider the Wronskian of the n solutions x (1) , x (2) , . . . , x (n) , defined by (7.149). 
We find 




a n e x " 

u. l2 e x *-" 

a ln e x "‘ 

W(x {l \ x (2) ,. 

..,X (n) )(0 = 

a 2l e x “ 

a 22 e x *--- 

«2 n e Xn ' 



a nl e x -‘ 

oc n2 e x >‘ - 

x„„e Xnt 





oc n 

a 12 


= 1 

9 (Xi + X 2 + • 

* +X n )t 

a 2l 

a 22 






<*„2 


By Result F of Section 7.5D, the n characteristic vectors a (1) , a (2) , . . . , a (n) are linearly 
independent. Therefore, using Exercise 8 at the end of Section 7.5C, we have 


Further, it is clear that 


an 

a i2 ‘ 

•• 

a 21 

a 22 

’• *2n 

a„i 

a«2 •' 

’• ot nn 


gUl + X 2 + + X n )t Q 


for all t. Thus x (2) , . . . , \ {n) )(t) / 0 for all t on [a, &]. Hence by Theorem 7. 14, the 

solutions x (1) , x (2) , . . . , x (w) , of vector differential equation (7.139) defined by (7.149), are 
linearly independent on [a, b]. Thus a general solution of (7.139) is given by 

c r x (1) + c 2 x (2) + • • • + c n \ {n) 

where c i9 c 2 . . . , c n are n arbitrary numbers. We summarize the results obtained in the 
following theorem: 


THEOREM 7.16 


Consider the homogeneous linear system 


dx t 

~dt 


= a li x l + a l2 x 2 + ••• + a ln x„, 


dx 2 

~di 


— ^21^1 "b @ 22 X 2 "b 


T ^2n X n^ 


(7.136) 


dX n 

dt 


~ a nl X l + a n2 X 2 + *** + a nn X n 


that is, the vector differential equation 
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where 



l a il 

a l2 

■■ flln\ 


l x l\ 

A = 

a 2 1 

a 22 

’• “in 

X = 

1 x 2 


\a„i 

“ n2 " 

'■ a nn j 


[*■! 


and the a {j , (i = 1 , 2 , ...,n;j = 1 , 2 , . . . , n), are real constants . 

Suppose each of the n characteristic values X i ,X 2 ,...,X n of A is distinct ; and to 
a (1) , a (2) , . . . , <x (n) be a set of n respective corresponding characteristic vectors of A. 

Then on every real interval , the n vector functions defined by 

a ( 1 ) e Al *, y. {2) e Xlt , ..., a (n) e Xnt 

form a linearly independent set of solutions of (7.136), that is, (7.139); and 
x = c x OL (l) e Xit + c 2 v. {2) e X2t + ••• + c n ai {n) e Xnt , 
where c { ,c 2 ,...,c n are n arbitrary constants, is a general solution of (7.136). 


► Example 7.38 

Consider the homogeneous linear system 

dx { 


dt 

dx 2 
dt 

dx 3 
dt 


= lx Y — x 2 + 6 x 3 , 

= — lOXi + 4 x 2 — 12x 3 , 
= —2x l + x 2 — ^ 3 , 


that is, the vector differential equation 


dx 

It 


Assuming a solution of (7.151) of the form 
that is, 


7 

-10 

-1 

4 

-12 

X, 

where x = 

i x i\ 
x 2 

I " 2 

1 




\ x 3j 


x = ae Xt , 


(7.150) 


(7.151) 


Xi = a t e Xt , x 2 = a 2 e Xt , x 3 = a 3 e Xt , 


we know that X must be a solution of the characteristic equation of the coefficient 
matrix 
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This characteristic equation is 


|A — AI| = 


7 — A -1 6 


-10 4- A 


-12 


= 0. 


-2 1 — 1 — A 


Expanding the determinant and simplifying, we see that the characteristic equation is 

A 3 - 10A 2 + 31A - 30 = 0, 

the factored form of which is 


(A - 2)(A - 3)(A - 5) = 0. 
We thus see that the characteristic values of A are 


A, = 2, A 2 = 3, and A 3 = 5. 


These are distinct (and real), and so Theorem 7.16 applies. We thus proceed to find 
characteristic vectors a (1) , a <2) , a <3) corresponding respectively to A t . A 2 . A 3 . We use the 
defining equation 

Aa = Aa (7.145) 

to do this. 

For A = A x = 2 and 


a = a (1) = 


defining equation (7.145) becomes 



/<* i\ 
a 2 

W 

( a 
a 2 

W 


= 2 


Performing the indicated multiplications and equating corresponding components of 
this, we find 


70^ — a 2 + 6a 3 = 2a! , 


— lOaj + 4a 2 — 12a 3 = 2a 2 , 


— 2a ! + a 2 — a 3 = 2a 3 . 

Simplifying, we find that ai, a 2 , a 3 must satisfy 

5a! — a 2 + 6a 3 = 0, 

— 10a! + 2a 2 — 12a 3 = 0, (7.152) 

— 2a! + a 2 ” 3a 3 = 0, 

The second of these three equations is merely a constant multiple of the first. Thus we 
seek nonzero numbers a l5 a 2 , a 3 that satisfy the first and third of these equations. 
Writing these two as equations in the unknowns a 2 and a 3 , we have 

— a 2 + 6a 3 = — 5a!, 

a 2 — 3a 3 = 2a!. 
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Solving for a 2 and a 3 , we find 

a 2 = — oc l and a 3 = — oq. 

A simple nontrivial solution of this is a! = 1, a 2 = — 1, a 3 = — 1. That is, = 1, a 2 = 
— 1, a 3 = — 1 is a simple nontrivial solution of the system (7.152). Thus a characteristic 
vector corresponding to X t = 2 is 

/ i\ 


Then by Theorem 7.16, 


"" 1 :: 


= —l\e 2t , that is, I —e 


(7.153) 


is a solution of (7.151). 
For X = X 7 = 3 and 


6 M * 

12 aJ = 3 a 


(7.145) becomes 


\ "2 1 -vW W 

Performing the indicated multiplications and equating corresponding components of 
this and then simplifying, we find that a l5 a 2 , a 3 must satisfy 

4ai — a 2 + 6a 3 = 0, 

— lOoq + a 2 — 12a 3 = 0, 

— 2a ! + a 2 — 4a 3 = 0. 


From these we find that 


a 2 = — 2a ! and a 3 = — a x . 


A simple nontrivial solution of this is a x = l,a 2 = -2, a 3 = — 1. Thus a characteristic 
vector corresponding to X 2 = 3 is 




Then by Theorem 7.16, 


x = —2 e 3t , that is, —2e 3t 
i-l/ 


(7.154) 


is a solution of (7.151). 
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For X = X 3 = 5 and 


(7.145) becomes 


a = a (3) = 


7 

-1 

6 \ 

a i\ 

^l\ 

-10 

4 

-12 

a 2 = 5 

a 2 

l- 2 

1 

- 1 / 

w 

r 3 / 


Performing the indicated multiplications and equating corresponding components of 
this, and then simplifying, we find that a 1? a 2 , a 3 must satisfy 


2a t — a 2 + 6a 3 = 0, 
— 10a! — a 2 — 12a 3 = 0, 
— 2a! + a 2 — 6a 3 = 0. 

From these we find that 


a 2 = — 2a ! and 3a 3 = —2a!. 


A simple nontrivial solution of this is ai = 3, a 2 = — 6, a 3 = — 2. Thus a characteristic 
vector corresponding to X 3 = 5 is 


Then by Theorem 7.16, 

x 


I 3 \ 

-6 

\- 2 / 


/ 3\ 

/ 3e 5 '\ 

1 

ON 

e 5 ', that is, — 6e 5 ' 

\- 2 i 

\- 2 ‘i 


(7.155) 


is a solution of (7.151). 

Also by Theorem 7.16, the solutions (7.153), (7.154), and (7.155) are linearly inde- 
pendent, and a general solution is 



e 2 '\ 




3e 5, \ 

\ = c 1 

-e 2 ' 

+ c 2 

— 2e 3 ' 

l -'*) 

+ c 3 

— 6e 5 ' 

i- 2 ^ 5 7 


where c t , c 2 , and c 3 are arbitrary constants. That is, in scalar language, a general 
solution of the homogeneous linear system (7.150) is 


x i = c i e2t + c 2 e3t + 3c 3 e 5f , 
x 2 = — c t e 2t — 2 c 2 e 3t — 6 c 3 e 5t , 
x 3 = —c t e 2t — c 2 e 3t — 2 c 3 e 5t , 


where c 1? c 2 , and c 3 are arbitrary constants. 
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We return to the homogeneous linear system (7.136), that is, the vector differential 
equation 

= Ax, (7.139) 

where A is an n x n real constant matrix, and reconsider the result stated in Theorem 
7.16. In that theorem we stated that if each of the n characteristic values X l9 X 2 , . . . 9 X n of 
A is distinct and if a C. * * * * * (1) , a (2) , . . . , a (n) is a set of n respective corresponding characteristic 
vectors of A, then the n functions defined by 

form a fundamental set of solutions of (7.139). Note that although we assume that X l9 
X l9 ... 9 X n are distinct , we do not require that they be real Thus distinct complex 
characteristic values may be present. However, since A is a real matrix, any complex 
characteristic values must occur in conjugate pairs. Suppose X t = a + bi and X 2 = 
a — bi form such a pair. Then the corresponding solutions are 

a (1 V fl+W)r and ^ 2) e {a ~ bi \ 

and these solutions are complex solutions. Thus if one or more distinct conjugate- 
complex pairs of characteristic values occur, the fundamental set defined by cn (i) e Xit , 
i = 1, 2, . . . , n, contains complex functions. However, in such a case, this fundamental 
set may be replaced by another fundamental set, all of whose members are real func- 
tions. This is accomplished exactly as explained in Section 7.4C and illustrated in Ex- 
ample 7.18. 


C. Case of Repeated Characteristic Values 


We again consider the vector differential equation 



(7.139) 


where A is an n x n real constant matrix; but here we give an introduction to the case in 
which A has a repeated characteristic value. To be definite, we suppose that A has a real 
characteristic value X Y of multiplicity m, where 1 < m < n, and that all the other 
characteristic values X m+i9 X m + 29 ... 9 X n (if there are any) are distinct. By result G of 
Section 7.5D, we know that the repeated characteristic value X l of multiplicity m has p 
linearly independent characteristic vectors, where 1 < p < m. Now consider two 
subcases: (1) p = m; and (2) p < m. 

In Subcase (1), p = m, there are m linearly independent characteristic vectors a (1) , 
a (2) , . . . , a (m) corresponding to the characteristic value X l of multiplicity m. Then the n 
functions defined by 


+ if ft (n) e Xnt 


form a linearly independent set of n solutions of differential equation (7.139); and a 
general solution of (7.139) is a linear combination of these n solutions having n 
arbitrary numbers as the “constants of combination.” 
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► Example 7.39 

Consider the homogeneous linear system 

dx t 


dt 

dx 2 

~dt 


= 3Xi + x 2 — x 3 


= x t + 3x 2 — x 3 , 


dx 3 

— = 3x t + 3 x 2 - x 3 , 


or in matrix form, 


dx 

~dt 


Assuming a solution of the form 
that is, 


I 3 

1 

-1) 


(x^ 

1 

3 

-1 

x, where x = 

x 2 

l 3 

3 



\ X V 


x = ace At , 


(7.156) 


(7.157) 


x t = a l e Xt , x 2 = oc 2 e Xt , x 3 = a 3 e A ', 

we know that X must be a solution of the characteristic equation of the coefficient 
matrix 



This characteristic equation is 


|A — AI| = 


3-/1 1 -1 

1 3-A -1 

3 3 -1-/1 


= 0 . 


Expanding the determinant and simplifying, we see that the characteristic equation is 

/t 3 - 5/L 2 + 8/1 - 4 = 0, 

the factored form of which is 

{X - 1)(X - 2){X - 2) = 0. 

We thus see that the characteristic values of A are 

X t = 1, X 2 = 2 and X 3 = 2. 

Note that whereas the number 1 is a distinct characteristic value of A, the number 2 is a 
repeated characteristic value. We again use 

Aa = Xai 


to find characteristic vectors corresponding to these characteristic values. 


(7.145) 
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For A = 1, and 



(7.145) becomes 


3 1 -l\ 



/«i\ 

13-1 

«2 

= 1 

a 2 

\ 3 3 - 1 / 

w 




Performing the indicated multiplications and equating corresponding components of 
this and then simplifying, we find that a l5 a 2 , a 3 must be a nontrivial solution of the 
system 

2a t + a 2 — a 3 = 0, 

+ 2a 2 — a 3 = 0, 

3a! + 3a 2 — 2a 3 = 0. 


One readily sees that such a solution is given by 


= 1, oc 2 = 1, a 3 = 3. 


Thus a characteristic vector corresponding to = 1 is 


Then 



(7.158) 


is a solution of (7.157). 

We now turn to the repeated characteristic value X 2 = A 3 = 2. In terms of the 
discussion just preceding this example, this characteristic value 2 has multiplicity 
m — 2 < 3 = n, where n — 3 is the common number of rows and columns of the 
coefficient matrix A. For X — 2 and 


(7.145) becomes 




\i« l \ 


(<*i\ 

* 

<N 

II 

a 2 

i\« 3 ) 




(7.159) 


Equating corresponding coefficients of this and simplifying, we find that , a 2 , a 3 must 
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be a nontrivial solution of the system 

+ oc 2 — a 3 = 0, 

<*1 + <*2 - <*3 = 0 , 

3a t + 3a 2 — 3a 3 = 0. 

Note that each of these three relations is equivalent to each of the other two, and so the 
only relationship among a 1? a 2 , a 3 is that given most simply by 


Observe that 


a l + a 2 ~~ a 3 = 0- 


1, a 2 = -1, a 3 


(7.160) 


= 1, a 2 = 0, a 3 = 1 

are two distinct solutions of this relation (7. 1 60). The corresponding vectors of the form 
(7.159) are thus 

a (2) = (— 1 \ and | a (3) = (oV 


respectively. Since each satisfies (7.145) with 1 = 2, each is a characteristic vector 
corresponding to the double root 1 2 = 1 3 = 2. Furthermore, using the definition of 
linear independence of a set of constant vectors, one sees that these vectors a (2) and a (3) 
are linearly independent. Thus the characteristic value X = 2 of multiplicity m = 2 has 
the p = 2 linearly independent characteristic vectors 


a (2) = — 1 and a (3) = 0 


corresponding to it. Hence this is an illustration of Subcase (1) of the discussion 
preceding this example. Thus, corresponding to the twofold characteristic value 2 = 2, 
there are two linearly independent solutions of system (7.157) of the form ae Xt . These 


that is, 


v. {2) e 2t and a (3) e 2f , 

i i \ i'\ 

— lie 2 ' and 0 e 2t , 

\ 0 1 


-e 2 ‘ and 0 , 

0 \e 2, i 


(7.161) 


respectively. 
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The three solutions 


/ e '^ 


e 2t \ 


(e 2 '\ 

e ‘\ 

5 

2 1 1 

, and J 

° 

w 


0 1 




given by {7.158) and (7.161) are linearly independent, and a general solution is 



M 


1 e2 ‘\ 


(e 2 <\ 

X = c x 

e' 

[3e'J 

+ C 2 

— e 2 ‘ 

\ 0 ) 

+ CsJ 

Iw 


where c l 9 c 2 , and c 3 are arbitrary constants. That is, in scalar language, a general 
solution of the homogeneous linear system (7.156) is 

*i = c v e l + (c 2 + c 3 )e 2 ’, 

x 2 = c t e' - c 2 e 2 ’, 

x 3 = 3 c y e' + c 3 e 2t . 

where c l9 c 2 , and c 3 are arbitrary numbers. 

One type of vector differential equation (7.139) which always leads to Subcase (1), 
p = m, in the case of a repeated characteristic value X x is that in which the n x n 
coefficient matrix A of (7.139) is a real symmetric matrix. For then, by Result J of 
Section 7.5D, there always exist n linearly independent characteristic vectors of A, 
regardless of whether the n characteristic values of A are all distinct or not. 

We now turn to a consideration of Subcase (2), p < m. In this case, there are less than 
m linearly independent characteristic vectors a (1) corresponding to the characteristic 
value X x of multiplicity m. Hence there are less than m linearly independent solutions of 
system (7.136) of the form a (1) e Al ' corresponding to X x . Thus there is not a full set of n 
linearly independent solutions of (7. 1 36) of the basic exponential form a (k) e Xkt , where X k 
is a characteristic value of A and a (k) is a characteristic vector corresponding to X k . 
Clearly we must seek linearly independent solutions of another form. 

To discover what other forms of solution to seek, we first look back at the analogous 
situation in Section 7.6C. The results there suggest the following: 

Let X be a characteristic value of multiplicity m = 2. Suppose p = 1 < m, so that 
there is only one type of characteristic vector a and hence only one type of solution of 
the basic exponential form ae Xt corresponding to X. Then a linearly independent 
solution is of the form 

(at + $)e M , 

where a is a characteristic vector corresponding to X , that is, a satisfies 

(A - Xl)a = 0; 

and p is a vector which satisfies the equation 

(A - AI)P = a. 

Now let X be a characteristic value of multiplicity m — 3 and suppose p < m. Here 
there are two possibilities: p = 1 and p = 2. 
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If p = 1, there is only one type of characteristic vector a and hence only one type of 
solution of the form 


*e Xt (7.162) 

corresponding to X. Then a second solution corresponding to X is of the form 

(at + $)e Xt 9 (7.163) 

where a is a characteristic value corresponding to X , that is, a satisfies 

(A - XI)* = 0; (7.164) 

and ft is a vector which satisfies the equation 

(A — AI)p = a. (7.165) 

In this case, a third solution corresponding to X is of the form 

+ + (7.166) 

where a satisfies (7.164), p satisfies (7.165), and y satisfies 

(A — Xl)y =p. (7.167) 

The three solutions (7.162), (7.163), and (7.166) so found are linearly independent. 

If p = 2, there are two linearly independent characteristic vectors a (1) and a (2) 
corresponding to 1 and hence there are two linearly independent solutions of the 
form 


* (1) e Xt and * {2) e Xt . 

(7.168) 

Then a third solution corresponding to X is of the form 


(at + V)e xt , 

(7.169) 

where a satisfies 

(A - XI)* = 0, 

and P satisfies 

(7.170) 

(A - AI)P = a. 

(7.171) 

Now we must be careful here. Let us explain: Since a (1) and a (2) are both characteris- 
tic vectors corresponding to A, both * = a (1) and * = a (2) satisfy (7.170). However, in 
general, neither of these values of * will be such that the resulting equation (7.171) in p 
will have a nontrivial solution for p. Thus, instead of using the simple solutions a (1) or 
a (2) of (7.170), a more general solution of that equation is needed. Such a solution is 
provided by 

a = k t « (1) + k 2 <x <2) . 

(7.172) 


where and k 2 are suitable constants. We now substitute (7.172) for a in (7.171) and 
determine k t and k 2 so that the resulting equation in p will have a nontrivial solution for 
p. With these values chosen for k t and k 2 , we thus have the required a and now find the 
desired nontrivial p. The three resulting solutions (7.168) and (7.169) thus determined 
are linearly independent. We illustrate this situation in the following example. 
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► Example 7.40 

Consider the homogeneous linear system 

dx t 


dt 

dx 2 

~di 


= 4*! + 3*2 + x 3 , 

= -4x t — 4x 2 - 2x 3 , 


(7.173) 


— — — 8x t 12 x 2 “h 6x3, 
dt 


or in matrix form, 


dx 

~di 


Assuming a solution of the form 
that is, 


/ 4 

3 




r 4 

-4 

-2 

x, where x = 

x 2 \ 

l 8 

12 

6 ) 


\ X 3 1 


(7.174) 


x = ae Ar , 


= ccy, x 2 = a 2 e xt , x 3 = a 3 e x ', 

we know that X must be a solution of the characteristic equation of the coefficient 
matrix 

1 4 5 >\ 

A = I —4 -4 -2. 

8 12 6 / 


This characteristic equation is 


| A — AI | = 


\ 


4 — A 3 1 

_4 —4 — X -2 

8 12 6- A 


= 0 . 


Expanding the determinant and simplifying, we see that the characteristic equation is 

A 3 - 6A 2 + 12A - 8 = 0, 

the factored form of which is (A — 2) 3 = 0. We thus see that the characteristic values of 
A are 


Ai - A 2 - A 3 - 2. 

That is, the number 2 is a triple characteristic value of A. We again use 

Aa = Aa 

to find the corresponding characteristic vector(s) a. 

With A = 2 and 


(7.145) 


a = 


/« i\ 

a 2 

w 


(7.175) 



7.7 THE MATRIX METHOD FOR HOMOGENEOUS LINEAR SYSTEMS 373 


(7.145) becomes 

/ 4 3 l\/aA («i 

-4 -4 -2 a 2 =2 a 2 

\ 8 12 6 /\ a 3 / \« 3 

Performing the indicated multiplications and equating corresponding components of 
this and then simplifying, we find that a l5 a 2 , a 3 must be a nontrivial solution of the 
system 

2a x 4- 3a 2 -I- a 3 = 0, 

— 4a t — 6a 2 — 2a 3 = 0, 

8ai + 12a 2 + 4a 3 = 0. 

Each of these three relationships is equivalent to each of the other two, and so the only 
relationship among a 1? a 2 , a 3 is that given most simply by 


Observe that 


2a ! + 3a 2 + a 3 = 0. 


(7.176) 


and 


a l = 1, a 2 = 0, a 3 = -2 
a t = 0, a 2 = 1, a 3 = -3 


are two distinct solutions of relation (7.176). The corresponding vectors of the form 
(7.175) are thus 



respectively. Since each satisfies (7.145) with X = 2, each is a characteristic vector 
corresponding to the triple characteristic value 2. Furthermore, it is easy to see that the 
two characteristic vectors a (1) and a (2) are linearly independent, whereas every set of 
three characteristic vectors corresponding to characteristic value 2 are linearly 
dependent. Thus the characteristic value X = 2 of multiplicity m = 3 has the p = 2 
linearly independent characteristic vectors 



(7.177) 


corresponding to it. Hence this is an illustration of the situation described in the 
paragraph immediately preceding this example. Thus corresponding to the triple 
characteristic value X = 2 there are two linearly independent solutions of system (7. 173) 
of the form ae Xt . These are a {l) e 2t and a (2) e 2r , that is, 
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( 

0 

\ — 2e 2t 


e 

— 3e 2t \ 


respectively. 

A third solution corresponding to X = 2 is of the form 


where a satisfies 


and p satisfies 


(at + P)e 2 ', 
(A - 2I)a = 0 
(A - 2I)P = a. 


(7.178) 

(7.179) 

(7.180) 

(7.181) 


Since both a ( X) and a <2) given by (7. 1 77) are characteristic vectors of A corresponding to 
2 = 2, they both satisfy (7.180). But, as noted in the paragraph immediately preceding 
this example, we need to use the more general solution 

a = /c, a (1) + /c 2 a (2) 

of (7.180) in order to obtain a nontrivial solution for f) in (7.181). 

Thus we let 



I *i 1 


(PA 

a = fe l a <1) + k 2 a (2) = 

k 2 

and |i = 

P 2 


{ — 2k 1 — 3k 2 j 


\N 


and then (7.181) becomes 


\(PA 


ki \ 

\\p 2 

= 

k 2 

!\n 


rT" 

1 

1 


Performing the indicated multiplications and equating corresponding components of 
this, we obtain 


2/?i + 3/? 2 + P 3 - k t , 

—4/?! — 6/? 2 — 2/? 3 = k 2 , 

8 pi + 12/?2 + 4/? 3 = -2k t -3k 2 . 


(7.182) 


Observe that the left members of these three relations are all proportional to one 
another. Using any two of the relations, we find that k 2 = —2k t . A simple nontrivial 
solution of this last relation is k y = 1, k 2 = — 2. With this choice of fcj and k 2 , we find 


« = -2 ; 

4 


(7.183) 
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and the relations (7.182) become 

20i + 30 2 + 0 3 = 1, 

-40,- 60 2 - 203 = - 2 , 
80i + 1202 + 403 = 4. 

Each of these is equivalent to 

201 + 302 + 03 = 1 . 

A nontrivial solution of this is 

0i = 02 = O, 03 = 1; 


and thus we obtain 


/ 0\ 


p = 


0 

w 


(7.184) 

Therefore, with a given by (7.183) and p given by (7.184), the third solution (7.179) is 


that is, 


[/ 


o\" 

-2 

t + 

0 

l *) 


VI. 


e 2 ', 


i<r 


— 2te 2 ' 

l(4t + l)e 2 ' I 


(7.185) 


The three solutions defined by (7.178) and (7.185) are linearly independent, and a 
general solution is the linear combination 



( e 2 '\ 

( o ^ 


/ te 2t \ 

Cl 

0 + c 2 

e 2 ' 

+ C 3 

— 2 te 2t 


\-2 e2 'l 

\ — 2>e 2, j 


^(4t + \)e 2t J 


of these three, where c l ,c 2 , c 3 are arbitrary constants. That is, in component form, a 
general solution of system (7.173) is 


x i = c i e2t + c 3 te 2t , 


x 2 = c 2 e 2t — 2 c 3 te 2t , 
x 3 = —2 c v e 2t — 3 c 2 e 2t + c 3 (4t + l)e 2t , 
where c u c 2 , c 3 are arbitrary constants. 


Exercises 

Find the general solution of each of the homogeneous linear systems in Exercises 1-24, 
where in each exercise 
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1. 


3. 


5. 


7. 


9. 


11 . 


13. 


15. 


17. 


19. 


21 . 


23. 


dx 

T 

dx 

T 

dx 

dt 

dx 

It 

dx 

It 

dx 

li 

dx 

T 

dx 

li 

dx 

dt 

dx 

Tt 

dx 

dt 

dx 

T 


fl 

1 

— 



2 

3 

- 

4 

X. 

l 4 

1 

- 

4 i 



1 

-1 


— 


1 

3 



\- 

3 

1 


- 

1 

2 




2 

0 

3 

X. 

\ 2 

3 

°i 




1 

— 2 


0^ 

- 

■2 

3 

0 


0 

0 

2 l 


7 

0 


4 \ 


8 

3 


8 

\“ 

-8 

0 

- 

- 5 ) 


X 

(: 

( 

( 

( 



„ dx 

1 T' 


x. 


4. 

dt 


6 . 


dx 

T 


8 . T 

dt 


10 . 


12 . 


14. 


16. 


18. 


20 . 


22 . 


dx 

T 

dx 

T 

dx 

T 

dx 

T 

dx 

T 

dx 

T 

dx 

T 


7 X. 


2 l 


1 \ 


24. 

dt 


• 1 \ 
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CHAPTER EIGHT 

Approximate Methods of Solving First-Order Equations 


In Chapter 2 we considered certain special types of first-order differential equations 
having closed-form solutions that can be obtained exactly. For a first-order differential 
equation that is not of one or another of these special types, it usually is not apparent 
how one should proceed in an attempt to obtain a solution exactly. Indeed, in most 
such cases the discovery of an exact closed-form solution in terms of elementary 
functions would be an unexpected luxury! Therefore, one considers the possibilities of 
obtaining approximate solutions of first-order differential equations. In this chapter we 
shall introduce several approximate methods. In the study of each method in this 
chapter our primary concern will be to obtain familiarity with the procedure itself and 
to develop skill in applying it. In general we shall not be concerned here with theoretical 
justifications and extended discussions of accuracy and error. We shall leave such 
matters, important as they are, to more specialized and advanced treatises and instead 
shall concentrate on the formal details of the various procedures. 


8.1 GRAPHICAL METHODS 

A. Line Elements and Direction Fields 

In Chapter 1 we considered briefly the geometric significance of the first-order 
differential equation 

£-/<*,)’), ( 8 . 1 ) 

where / is a real function of x and y. The explicit solutions of (8.1) are certain real 
functions, and the graphs of these solution functions are curves in the xy plane called 
the integral curves of (8.1). At each point (x, y) at which /(x, y) is defined, the differential 
equation (8.1) defines the slope /(x, y) at the point (x, y) of the integral curve of (8.1) 
that passes through this point. Thus we may construct the tangent to an integral curve 
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of (8.1) at a given point (x, y) without actually knowing the solution function of which 
this integral curve is the graph. 

We proceed to do this. Through the point (x, y) we draw a short segment of the 
tangent to the integral curve of (8.1) that passes through this point. That is, through 
(x, y) we construct a short segment the slope of which is /(x, y), as given by the 
differential equation (8.1). Such a segment is called a line element of the differential 
equation (8.1). 

For example, let us consider the differential equation 

Tx = lx + y - (8 ' 2) 

Here /(x, y) = 2x + y, and the slope of the integral curve of (8.2) that passes through 
the point (1,2) has at this point the value 

/( 1,2) = 4. 

Thus through the point (1, 2) we construct a short segment of slope 4 or, in other words, 
of angle of inclination approximately 76° (see Figure 8.1). This short segment is the line 
element of the differential equation (8.2) at the point (1,2). It is tangent to the integral 
curve of (8.2) which passes through this point. 

Let us now return to the general equation (8.1). A line element of (8.1) can be 
constructed at every point (x, y) at which /(x, y) in (8.1) is defined. Doing so for a 
selection of different points (x, y) leads to a configuration of selected line elements that 
indicates the directions of the integral curves at the various selected points. We shall 
refer to such a configuration as a line element configuration. 

For each point (x, y) at which /(x, y) is defined, the differential equation (8.1) thus 
defines a line segment with slope /(x, y), or, in other words, a direction. Each such 
point, taken together with the corresponding direction so defined, constitutes the so- 
called direction field of the differential equation (8.1). We say that the differential 
equation (8.1) defines this direction field, and this direction field is represented 
graphically by a line element configuration. Clearly a more thorough and carefully 
constructed line element configuration gives a more accurate graphical representation 
of the direction field. 



Figure 8.1 Line element of differential equation (8.2) at point (1, 2). 
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For a given differential equation of the form (8.1), let us assume that a “thorough and 
carefully constructed” line element configuration has been drawn. That is, we assume 
that line elements have been carefully constructed at a relatively large number of 
carefully chosen points. Then this resulting line element configuration will indicate the 
presence of a family of curves tangent to the various line elements constructed at the 
different points. This indicated family of curves is approximately the family of integral 
curves of the given differential equation. Actual smooth curves drawn tangent to the 
line elements as the configuration indicates will thus provide approximate graphs of 
the true integral curves. 

Thus the construction of the line element configuration provides a procedure for 
approximately obtaining the solution of the differential equation in graphical form. We 
now summarize this basic graphical procedure and illustrate it with a simple example. 

Summary of Basic Graphical Procedure 

1. Carefully construct a line element configuration, proceeding until the family of 
“approximate integral curves” begins to appear. 

2. Draw smooth curves as indicated by the configuration constructed in Step 1. 

► Example 8.1 

Construct a line element configuration for the differential equation 

^ = 2 * + y, (8.2) 

and use this configuration to sketch the approximate integral curves. 

Solution. The slope of the exact integral curve of (8.2) at any point (x, y) is given by 

/(x, y) = 2x + y. 

We evaluate this slope at a number of selected points and so determine the 
approximate inclination of the corresponding line element at each point selected. We 
then construct the line elements so determined. From the resulting configuration we 
sketch several of the approximate integral curves. A few typical inclinations are listed 
in Table 8.1 and the completed configuration with the approximate integral curves 
appears in Figure 8.2. 


TABLE 8.1 


X 

y 

dy/dx 

(Slope) 

Approximate inclination 
of line element 

1 

2 

i 

~2 

1 

2 

IT 

1 

2 

0 

l 

45° 

1 

2 

1 

2 

3 

2 

56° 

1 

2 

1 

2 

63° 

1 

1 

~ 2 

3 

2 

56° 

1 

0 

2 

63° 

1 

1 

2 

1 

2 

68° 

1 

1 

3 

72° 
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Comments. The basic graphical procedure outlined here is very general since it can 
be applied to any first-order differential equation of the form (8.1). However, the 
method has several obvious disadvantages. For one thing, although it provides the 
approximate graphs of the integral curves, it does not furnish analytic expressions for 
the solutions, either exactly or approximately. Furthermore, it is extremely tedious and 
time-consuming. Finally, the graphs obtained are only approximations to the graphs 
of the exact integral curves, and the accuracy of these approximate graphs is uncertain. 
Of course, apparently better approximations can be obtained by constructing more 
complete and careful line element configurations, but this in turn increases the time and 
labor involved. We shall now consider a procedure by which the process may be 
speeded up considerably. This is the so-called method of isoclines. 


B. The Method of Isoclines 
DEFINITION 

Consider the differential equation 

% ~ nx ' y) - ,8i) 
A curve along which the slope f(x , y) has a constant value c is called an isocline of the 
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differential equation (8.1). That is, the isoclines of (8.1) are the curves f(x, y) = c, for 
different values of the parameter c. 

For example, the isoclines of the differential equation 

dv 

— = 2 x + y (8.2) 

dx 

are the straight lines 2x + y = c. These are of course the straight lines y = — 2x + c of 
slope — 2 and ^-intercept c. 

Caution. Note carefully that the isoclines of the differential equation (8.1) are not in 
general integral curves of (8.1). An isocline is merely a curve along which all of the line 
elements have a single, fixed inclination. This is precisely why isoclines are useful. Since 
the line elements along a given isocline all have the same inclination, a great number of 
line elements can be constructed with ease and speed, once the given isocline is drawn 
and one line element has been constructed upon it. This is exactly the procedure that we 
shall now outline. 


Method of Isoclines Procedure 

1. From the differential equation 

d £ = f(x, y) (8.1) 

determine the family of isoclines 

f(x, y) = c , (8.3) 

and carefully construct several members of this family. 

2. Consider a particular isocline f(x , y) = c 0 of the family (8.3). At all points (x, y) on 
this isocline the line elements have the same slope c 0 and hence the same inclination 
a 0 = arctan c 0 , 0° < a 0 < 180°. At a series of points along this isocline construct line 
elements having this inclination a 0 . 

3. Repeat Step 2 for each of the isoclines of the family (8.3) constructed in Step 1. In 
this way the line element configuration begins to take shape. 

4. Finally, draw the smooth curves (the approximate integral curves) indicated by 
the line element configuration obtained in Step 3. 


► Example 8.2 

Employ the method of isoclines to sketch the approximate integral curves of 

% = 2 * + y- < 8 - 2 > 

Solution. We have already noted that the isoclines of the differential equation (8.2) 
are the straight lines 2x + y = c or 


y = — 2x + c. 


(8.4) 
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Figure 8.3 


InFigure 8.3 we construct these lines fore = — 2, — §, - 1, — |,0,y, 1,§, and 2. On each 
of these we then construct a number of line elements having the appropriate inclination 
a = arctan c, 0° < a < 180°. For example, for c = 1, the corresponding isocline is y = 
— 2x+ 1, and on this line we construct line elements of inclination arctan 1 = 45°. In 
the figure the isoclines are drawn with dashes and several of the approximate integral 
curves are shown (drawn solidly). 

► Example 8.3 

Employ the method of isoclines to sketch the approximate integral curves of 

d -f = X 2 + y\ (8.5) 

ax 

Solution. The isoclines of the differential equation (8.5) are the concentric circles 
x 2 + y 2 = c, c > 0. In Figure 8.4 the circles for which c = y£, *, 1, f§, and 4 

have been drawn with dashes, and several line elements having the appropriate 
inclination have been drawn along each. For example, for c = 4, the corresponding 
isocline is the circle x 2 + y 2 = 4 of radius 2, and along this circle the line elements have 
inclination arctan 4 % 76°. Several approximate integral curves are shown (drawn 
solidly). 



Figure 8.4 
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8.2 POWER SERIES METHODS 
A. Introduction 

Although the graphical methods of the preceding section are very general, they suffer 
from several serious disadvantages. Not only are they tedious and subject to possible 
errors of construction, but they merely provide us with the approximate graphs of the 
solutions and do not furnish any analytic expressions for these solutions. Although we 
have now passed the naive state of searching for a closed-form solution in terms of 
elementary functions, we might still hope for solutions that can be represented as some 
type of infinite series. In particular, we shall seek solutions that are representable as 
power series. 

We point out that not all first-order differential equations possess solutions that can 
be represented as power series, and it is beyond the scope of this book to consider 
conditions under which a first-order differential equation does possess such solutions. 
In order to explain the power series methods We shall assume that power series 
solutions actually do exist, realizing that this is an assumption that is not always 
justified. 

Specifically, we consider the initial-value problem consisting of the differential 
equation 

< 8i) 


and the initial condition 


y(*o) = ^0 (8.6) 

and assume that the differential equation (8.1) possesses a solution that is representable 
as a power series in powers of (x — x 0 ). That is, we assume that the differential equation 
(8.1) has a solution of the form 


00 

y = c 0 + c t (x - x 0 ) + c 2 (x - x 0 ) 2 + • • • = £ c„(x - x 0 f (8.7) 

n = 0 

that is valid in some interval about the point x 0 . We now consider methods of 
determining the coefficients c 0 , c u c 2 ,. . . in (8.7) so that the series (8.7) actually does 
satisfy the differential equation (8.1). 


B. The Taylor Series Method 


We thus assume that the initial-value problem consisting of the differential equation 
(8.1) and the initial condition (8.6) has a solution of the form (8.7) that is valid in some 
interval about x 0 . Then by Taylor’s theorem, for each x in this interval the value y(x) of 
this solution is given by 

y{x) = y(x 0 ) + /(x 0 )(x - x 0 ) + (x - x 0 ) 2 + • • • 


y /^(Xq) 

n = 0 n\ 


(x - x 0 ) n . 


( 8 . 8 ) 
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From the initial condition (8.6), we have 

y(x 0 ) = y<» 

and from the differential equation (8.1) itself, 

/(*o) = /(*o, yo)- 

Substituting these values of y(x 0 ) and /(x 0 ) into the series in (8.8), we obtain the first 
two coefficients of the desired series solution (8.7). Now differentiating the differential 
equation (8.1), we obtain 

= fx(x, y) + fy(x, y)f(x, y), (8.9) 

where we use subscripts to denote partial differentiations. From this we obtain 
/'(Xo) = fx(x 0 , yo) + fy(x 0, yo)f(x 0 ,yo\ 

Substituting this value of y"(x 0 ) into (8.8), we obtain the third coefficient in the series 
solution (8.7). Proceeding in like manner, we differentiate (8.9) successively to obtain 

d 3 y d*y d n y 
dx 3 ’dx 4 ’ ’dx"’ 

From these we obtain the values 

Substituting these values into (8.8), we obtain the fourth and following coefficients in 
the series solution (8.7). Thus the coefficients in the series solution (8.7) are successively 
determined. 


► Example 8.4 

Use the Taylor series method to obtain a power series solution of thelnitial-value 
problem 

% = x 2 + y 2 , (8.10) 

dx 

y(0) = i, (8.ii) 

in powers of x. 


Solution. Since we seek a solution in powers of x, we set x 0 = 0 in (8.7) and thus 
assume a solution of the form 


y = c 0 + c t x + c 2 x 2 + • • • = Y, 


n = 0 


By Taylor’s theorem, we know that for each x in the interval where this solution is valid 

y(x) = y(0) + y'(0)x+^x 2 + ---= £ x". 

2! „=o n\ 


2 ! 


( 8 . 12 ) 
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The initial condition (8.1 1) states that 


y(0) = i, 

(8.13) 

and from the differential equation (8.10) we see that 


/(0) = 0 2 + l 2 = 1. 

(8.14) 

Differentiating (8.10) successively, we obtain 


d 2 y , . dy 

d^ = 2x + 2y d^’ 

(8.15) 

d 3 y m d2y 4 - if dy Y 

dx 3 2 2y dx 2 2 \dx J ’ 

(8.16) 

d A y - d 3 y dy d 2 y 
dx 4 ^ dx 3 dx dx 2 

(8.17) 


dy 

Substituting x = 0, y = 1, — = 1, into (8.15), we obtain 


/'(0) = 2(0) + 2(1)(1) = 2. (8.18) 

Substituting v = 1, = 1, = 2 into (8.16), we obtain 

dx dx z 

/"( 0) = 2 + 2(1)(2) + 2(1) 2 = 8. (8.19) 

Finally, substituting y = 1, ^ = 1, = 2, ^y = 8 into (8.17), we find that 

/ iv) ( 0) = (2)(1)(8) + (6)(1)(2) = 28. (8.20) 
By successive differentiation of (8.17), we could proceed to determine 

d 5 y d 6 y 
dx 5 ' dx 6 '"' 

and hence obtain 

y v) (0), y vi) (0),--- 

Now substituting the values given by (8. 1 3), (8. 1 4), (8. 1 8), (8. 1 9), and (8.20) into (8. 1 2), we 
obtain the first five coefficients of the desired series solution. We thus find the solution 

i 2 2 8 3 28 4 

,=, +x+ - x - >+-* •+- 

4 7 

= 1 -+-x-bx 2 -b — x 3 -b — x 4 + * * *. (8.21) 

3 6 

C. The Method of Undetermined Coefficients 

We now consider an alternative method for obtaining the coefficients c 0 , c t , c 2 , . . . in 
the assumed series solution 

oo 

y = c 0 + c t (x - x 0 ) + C 2 (x - x 0 ) 2 + ■■■ = £ c n (x - x 0 f 

n = 0 


(8.7) 
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of the problem consisting of the differential equation (8.1) with initial condition (8.6). 
We shall refer to this alternative method as the method of undetermined coefficients. In 
order to apply it we assume that / (x, y) in the differential equation (8.1) is representable 
in the form 

f{x, y) = a 00 + a l0 (x - x 0 ) + a 01 (y - y 0 ) + « 2 o(* - *o) 2 

+ a, ,(x - x 0 )()> - y 0 ) + a 02 (y - y 0 ) 2 + ' ' ' • (8.22) 


The coefficients a V) in (8.22) may be found by Taylor’s formula for functions of two 
variables, although in many simple cases the use of this formula is unnecessary. Using 
the representation (8.22) for /(x, y), the differential equation (8.1) takes the form 



oo 


+ a 10 (x - x 0 ) + a 0l (y - y 0 ) + a 20 (x - x 0 ) 2 


+ a u (x-x 0 )(y->'o) + a 02 (y-y 0 ) 2 


(8.23) 


Now assuming that the series (8.7) converges in some interval |x — x 0 | < r(r > 0) 
about x 0 , we may differentiate (8.7) term by term and the resulting series will also 
converge on |x — x 0 | < r and represent y'(x) there. Doing this we thus obtain 


dy_ 

dx 


= c t + 2 c 2 (x 


x 0 ) + 3 c 3 (x - x 0 ) 2 + • • •. 


(8.24) 


We note that in order for the series (8.7) to satisfy the initial condition (8.6) that y = y 0 
at x = x 0 , we must have c 0 = y 0 and hence 

y ~ J'o = c i( x ~ x o ) + c 2 (* - x 0 ) 2 + • • • . (8.25) 

Now substituting (8.7) and (8.24) into the differential equation (8.23), and making use 
of (8.25), we find that 

+ 2c 2 (x - x 0 ) + 3 c 3 (x - x 0 ) 2 + ••• 

= a 00 + a 10 (x - x 0 ) + a 0 i[ c i(x - x 0 ) + c 2 (x - x 0 ) 2 + •] 

+ a 20 (x - x 0 ) 2 + a n (x - x 0 )[c!(x - x 0 ) + c 2 (x - x 0 ) 2 + •••] 

+ a 02 icfx - x 0 ) + c 2 (x - x 0 ) 2 + •••] 2 + '••• (8-26) 

Performing the multiplications indicated in the right member of (8.26) and then 
combining like powers of (x — x 0 ), we see that (8.26) takes the form 

c t + 2c 2 (x — x 0 ) + 3 c 3 (x — x 0 ) 2 + • • • 

= ^oo + ( a io “h a oi c i)( x ~~ x o) 

+ (a 0l c 2 + a 20 + a u c t + a 02 c?)(x - x 0 ) 2 + •••. (8.27) 

In order that (8.27) be satisfied for all values of x in the interval |x — x 0 | < r, the 
coefficients of like powers of (x — x 0 ) on both sides of (8.27) must be equal. Equating 
these coefficients, we obtain 


c i = a oo 9 

2 c 2 — Ci l0 + doi c i ? 

3c 3 = d 0l c 2 + d 2 o + + d 02 cl , 


(8.28) 
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From the conditions (8.28) we determine successively the coefficients Cj , c 2 , c 3 , . . . of the 
series solution (8.7). From the first of conditions (8.28) we first obtain c l as the known 
coefficient a 00 . Then from the second of conditions (8.28) we obtain c 2 in terms of the 
known coefficients a l0 and a 0l and the coefficient c t just determined. Thus we obtain 
c 2 = i( a io + ^oi^oo)- I n 'ike manner, we proceed to determine c 3 ,c 4 ,.... We observe 
that in general each coefficient c n is thus given in terms of the known coefficients a {j in 
the expansion (8.22) and the previously determined coefficients c l ,c 2 ,...,c„- l . 

Finally, we substitute the coefficients c 0 , c l9 c 2 , . . . so determined into the series (8.7) 
and thereby obtain the desired solution. 


► Example 8.5 

Use the method of undetermined coefficients to obtain a power series solution of the 
initial-value problem 

^ = X I 2 + >> 2 , (8.10) 

y(0) = l, (8.11) 

in powers of x. 

Solution. Since x 0 = 0, the assumed solution (8.7) is of the form 

y = c 0 4 c^x + c 2 x 2 + c 3 x 3 4 • • • . (8.29) 

In order to satisfy the initial condition (8.1 1), we must have c 0 = 1 and hence the series 
(8.29) takes the form 

y = 1 4 c t x 4- c 2 x 2 4 c 3 x 3 4 ••*. (8.30) 

Differentiating (8.30), we obtain 

^ = c t 4 2c 2 x 4 3c 3 x 2 4 4c 4 x 3 4 * * * . (8.31) 

For the differential equation (8.10) we have /(x, y) = x 2 4 y 2 . Since x 0 = 0 and 
y 0 = 1, we must expand x 2 4 y 2 in the form 

I a u x\y - 1) J '. 
u = o 

Since 

y 2 = (y - l) 2 + 2(y — l) 4 l, 
the desired expansion is given by 

x 2 4 y 2 = 1 4 2(y - 1) 4 x 2 4 (y - l) 2 . 

Thus the differential equation (8.10) takes the form 


/ =U2(y-l)4x 2 4(y-l) 2 


(8.32) 
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Now substituting (8.30) and (8.31) into the differential equation (8.32), we obtain 
c i + 2 c 2 x + 3c 3 x 2 + 4 c 4 x 3 + ••• 

= 1 + 2(cjX + c 2 x 2 + c 3 x 3 + •••) + x 2 + (c l x + c 2 x 2 + • • *) 2 - (8.33) 

Performing the indicated multiplications and collecting like powers of x in the right 
member of (8.33), we see that it takes the form 

c t + 2c 2 x + 3c 3 x 2 + 4c 4 x 3 + ••• 

= 1 + 2c y x + (2c 2 + 1 + c 2 )x 2 + (2c 3 + 2c 1 c 2 )x 3 + •••. (8.34) 

Equating the coefficients of the like powers of x in both members of (8.34), we obtain 
the conditions 


Ci = 1, 

2c 2 = 2c 1? 

3c 3 - 2c 2 + 1 + c 2 , 
4c 4 = 2c 3 + 2c x c 2 , 


(8.35) 


From the conditions (8.35), we obtain successively 

Ci = 1, 

c 2 =Ci = 1, 

C 3 = 3(2c 2 + 1 + c\) = f, 

C 4 — i(2c 3 + 2 ciC 2 ) = 4 (^) = 6, 


(8.36) 


Substituting the coefficients so determined in (8.36) into the series (8.30), we obtain the 
first five terms of the desired series solution. We thus find 

y= 1 +x + x 2 +fx 3 + ^x 4 + ---. 

We note that this is of course the same series previously obtained by the Taylor series 
method and already given by (8.21). 


Remark. We have made but little mention of the interval of convergence of the 
series involved in our discussion. We have merely assumed that a power series solution 
exists and converges on some interval |x — x 0 | < r(r > 0) about the initial point x 0 . In 
a practical problem the interval of convergence is of vital concern and should be 
determined, if possible. Another matter of great importance in a practical problem is 
the determination of the number of terms which have to be found in order to be certain 
of a sufficient degree of accuracy. We shall not consider these matters here. Our 
primary purpose has been merely to explain the details of the methods. We refer the 
reader to more advanced treatises for discussions of the importance questions of 
convergence and accuracy. 
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Exercises 


Obtain a power series solution in powers of x of each of the initial-value problems in 
Exercises 1-7 by (a) the Taylor series method and (b) the method of undetermined 
coefficients. 


d i- x+ >- 

3. %=' + V, y( o) = 2. 

5. = x + sin y, 3/(0) = 0. 

ax 

dy 

7. — = e x + x cos y, y(0) = 0. 


2. ^ = x 2 + 2 y 2 , 3/(0) = 4. 

4. ^ = x 3 + y 3 , y(0) = 3. 

ax 

dy 

6. — = 1 + x sin y, y(0) = 0. 


Obtain a power series solution in powers of x — 1 of each of the initial-value problems 
in Exercises 8-12 by (a) the Taylor series method and (b) the method of undetermined 
coefficients. 


8. 

^ = x 2 +y 2 , y( 1) = 4. 

9. ^ = x 3 + y 2 , 
dx 

3/(1)= 1. 

10. 

dy t 

J- = X + y + y\ 3/(1) =1. 

dy 

11. — = x + cos y, 
dx 

3/(1) = it. 

12. 

dy , . ... n 

— = x 2 + xsm y, y( 1) = -. 




8.3 THE METHOD OF SUCCESSIVE APPROXIMATIONS 
A. The Method 

We again consider the initial-value problem consisting of the differential equation 

£ = /(*,>0 ( 8 - 1 ) 

and the initial condition 

y(xo) = yo- (8.6) 

We now outline the Picard method of successive approximations for finding a solution 
of this problem which is valid on some interval that includes the initial point x 0 . 

The first step of the Picard method is quite unlike anything that we have done before 
and at first glance it appears to be rather fruitless. For the first step actually consists of 
making a guess at the solution! That is, we choose a function </> 0 and call it a “zeroth 
approximation” to the actual solution. How do we make this guess? In other words, 
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what function do we choose? Actually, many different choices could be made. The only 
thing that we know about the actual solution is that in order to satisfy the initial 
condition (8.6) it must assume the value y 0 at x = x 0 . Therefore it would seem 
reasonable to choose for (j) 0 a function that assumes this value y 0 at x = x 0 . Although 
this requirement is not essential, it certainly seems as sensible as anything else. In 
particular, it is often convenient to choose for <j> 0 the constant function that has the 
value y 0 for all x. While this choice is certainly not essential, it is often the simplest, most 
reasonable choice that quickly comes to mind. 

In summary, then, the first step of the Picard method is to choose a function (f) 0 which 
will serve as a zeroth approximation. 

Having thus chosen a zeroth approximation </> 0 , we now determine a first ap- 
proximation (j) t in the following manner. We determine (^(x) so that (1) it satisfies the 
differential equation obtained from (8.1) by replacing y in /(x, y) by </> 0 (x), and (2) it 
satisfies the initial condition (8.6). Thus </> x is determined such that 

^-[<M*)]=/IX<M*)] (8-37) 


and 


<M*o) = yo- (8-38) 

We now assume that /[x, </> 0 (x)] is continuous. Then </>! satisfies (8.37) and (8.38) if and 
only if 

<t>i(x) = y 0 + f f[t,<Po(t)]dt. (8.39) 

Jxo 

From this equation the first approximation (j) l is determined. 

We now determine the second approximation </> 2 in a similar manner. The function 
</> 2 is determined such that 

4 - [<M*)] = fix, <Mx)] (8.40) 


and 


02(xo) = }V (8-41) 

Assuming that /[x,^)*)] is continuous, then <p 2 satisfies (8.40) and (8.41) if and 
only if 


<t>2(x) = >>0 + 


X Rt, <Mf)] dt. 

XQ 


(8.42) 


From this equation the second approximation <j) 2 is determined. 

We now proceed in like manner to determine a third approximation </> 3 , a fourth 
approximation </> 4 , and so on. The nth approximation </>„ is determined from 

<t>n(x) = yo + [ /[t,0„-i(O] dt, (8.43) 

Jxo 

where </>„_! is the (n — l)st approximation. We thus obtain a sequence of functions 





392 APPROXIMATE METHODS OF SOLVING FIRST-ORDER EQUATIONS 


where 0 O is chosen, (j) l is determined from (8.39), 0 2 is determined from (8.42), . . . , and in 
general 0„ is determined from (8.43) for n > 1. 

Now just how does this sequence of functions relate to the actual solution of the 
initial-value problem under consideration? It can be proved, under certain general 
conditions and for x restricted to a sufficiently small interval about the initial point x 0 , 
that (1) as n oo the sequence of functions 0„ defined by (8.43) for n > 1 approaches a 
limit function 0, and (2) this limit function 0 satisfies both the differential equation (8.1) 
and the initial condition (8.6). That is, under suitable restrictions the function 0 defined 
by 


0 = lim 0„ 

n-> oo 

is the exact solution of the initial-value problem under consideration. Furthermore, the 
error in approximating the exact solution 0 by the nth approximation 0„ will be 
arbitrarily small provided that n is sufficiently large and that x is sufficiently close to the 
initial point x 0 . 


B. An Example; Remarks on The Method 

We illustrate the Picard method of successive approximations by applying it to the 
initial-value problem of Examples 8.4 and 8.5. 


► Example 8.6 


Use the method of successive approximations to find a sequence of functions that 
approaches the solution of the initial-value problem 


dy_ 

dx 


= x 2 + y 2 , 


( 8 . 10 ) 


y(0) = l. 


( 8 . 11 ) 


Solution. Our first step is to choose a function for the zeroth approximation 0 O . 
Since the initial value of y is 1, it would seem reasonable to choose for 0 O the constant 
function that has the value 1 for all x. Thus, we let 0 O be such that 

0o(x) = 1 

for all x. The nth approximation 0„ for n > 1 is given by formula (8.43). Since 

f(x, y) = X 2 + y 2 

in the differential equation (8.10), the formula (8.43) becomes in this case 

{t 2 + [<£„-i(t)] 2 } dt, n> 1. 


<t>„(x) = 1 + 


0 
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Using this formula for n = 1, 2, 3, . . . , we obtain successively 
0iM = 1 + f {t 2 + [0 o (O] 2 } dt 


r 

r 


= 1 + (t 2 + 1) dt — 1 + x H — — , 


<£ 2 (x) =1+ {t 2 + [<M0] 2 } dt = 1 + 


£ 

£ 


£ 


,3\2 


t + ( 1 + ? + 


dt 


1 + It + It 2 + — + 


2 t 3 2 t 4 t 6 


+ ■ 


dt 


2x 3 


2x 5 


= 1+x + x +_ + _ + _ + _ 
<f> 3 {x) =1+1 {t 2 + [</> 2 (r)] 2 } dt 


= 1 + 

= 1 + 


£ 

£ 

£ 


t 2 + i + 1 + 1 2 + 


2t 3 t 4 2 1 5 t J Y 


+ 


+ TT 


6 15 63/ 


dt 


' , , , lOr 3 8t 4 29t 5 47f 6 1 64f 7 

1 + 2t + 4t 2 + — + — + — + — + — — 

3 3 15 45 315 


299t 8 8 t 9 184 t 10 

-j~ -J- -j- — -j- 

1260 105 4725 


AY t 14 1 
189 + 945 + 3969 ] 


= 1 + X + X 2 + 


4x 3 5x 4 8x 5 29x 6 47x 7 41x 8 


■ + ■ 


+ 


15 


+ 


90 


+ 


315 


+ 


299x 9 


630 11,340 


+ 


4x 10 184x u x 12 


+ 


4x 13 x 
+ -— + 


15 


525 51,975 2268 12,285 59,535 


We have chosen the zeroth approximation </> 0 and then found the next three ap- 
proximations 02, and 03 explicitly. We could proceed in like manner to find 0 4 , 
0 5 , . . . explicitly and thus determine successively the members of the sequence {0„} that 
approaches the exact solution of the initial-value problem under consideration. 
However, we believe that the procedure is now clear, and for rather obvious reasons we 
shall not proceed further with this problem. 

Remarks. The greatest disadvantage of the method of successive approximations is 
that it leads to tedious, involved, and sometimes impossible calculations. This is amply 
illustrated in Example 8.6. At best the calculations are usually very complicated, and in 
general it is impossible to carry through more than a few of the successive integrations 
exactly. Nevertheless, the method is of practical importance, for the first few ap- 
proximations alone are sometimes quite accurate. 

However, the principal use of the method of successive approximations is in proving 
existence theorems. Concerning this, we refer the reader to Chapter 10. 
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Exercises 

For each of the initial-value problems in Exercises 1-8 use the method of successive 
approximations to find the first three members </> l5 <j> 2 , <t> 3 of a sequence of functions 
that approaches the exact solution of the problem. 



= l. 

2. 


o 

II 


O 

II 

O 

4. 


II 

p 


o 

II 

o 

6. 

dy . 2 

— = sinx + v 
dx 

, y(0) = o. 


y(0) = o. 

8. 

j- = 1 + 6xy 4 , 
dx 

O 

II 


8.4 NUMERICAL METHODS 

A. Introduction; A Problem for Illustration 

In this section we introduce certain basic numerical methods for approximating the 
solution of the initial-value problem consisting of the differential equation 

and the initial condition 


y(xo) = ( 8 - 6 ) 

Numerical methods employ the differential equation (8.1) and the condition (8.6) to 
obtain approximations to the values of the solution corresponding to various, selected 
values of x. To be more specific, let y denote the solution of the problem and let h 
denote a positive increment in x. The initial condition (8.6) tells us that y = y 0 at x = x 0 . 
A numerical method will employ the differential equation (8.1) and the condition (8.6) 
to approximate successively the values of y at x t = x 0 + h,x 2 = x t + h,x 3 = 
x 2 + h , . . . . 

Let us denote these approximate values of y by y l5 y 2 , y 3 , . . . , respectively. That is, we 
let y n denote the approximate value of y at x = x„ = x 0 + nh, n = 1, 2, 3, ... . Now all 
that we know about y before starting is that y = y 0 at x = x 0 . In order to get started we 
need a method that requires only the value y n in order to obtain the next value y n + 1 . For 
we can apply such a method with n = 0 and use the initial value y 0 to obtain the next 
value y l . A method that uses only y n to find y n + l , and that therefore enables us to get 
started, is called a starting method . Once we have used a starting method to find y t , we 
can repeat it with n = 1 to find y 2 , with n = 2 to find y 3 , and so forth. However, once we 
have several calculated values at our disposal, it is often convenient to change over to a 
method that uses both y n and one or more preceding values y n _ l9 y n _ 2 , ... to find the 
next value y n + l . Such a method, which enables us to continue once we have got 
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sufficiently well started, is called a continuing method. Most of our attention in this text 
will be devoted to starting methods. 

Our principal objective in this section is to present the actual details of certain basic 
numerical methods for solving first-order initial-value problems. In general, we shall 
not consider the theoretical justifications of these methods, nor shall we enter into 
detailed discussions of matters such as accuracy and error. 

Before turning to the details of the numerical methods to be considered, we 
introduce a simple initial-value problem that we shall use for purposes of illustration 
throughout this section. We consider the problem 

( jl = 2x + y, (8.2) 

y(0) = 1. (8-44) 

We have already employed the differential equation (8.2) to illustrate the graphical 
methods of Section 8.1. We note at once that it is a linear differential equation and 
hence it can be solved exactly. Using the methods of Section 2.3, we find at once that its 
general solution is 

y=- 2(x+ 1 ) + ce x , (8.45) 

where c is an arbitrary constant. Applying the initial condition (8.44) to (8.45), we find 
that the exact solution of the initial-value problem consisting of (8.2) and (8.44) is 

y= -2(x+ 1) + 3e x . (8.46) 

We have chosen the problem consisting of (8.2) and (8.44) for illustrative purposes for 

two reasons. First, the differential equation (8.2) is so simple that numerical methods 
may be applied to it without introducing involved computations that might obscure 
the main steps of the method to a beginner. Second, since the exact solution (8.46) of the 
problem has been found, we can compare approximate solution values obtained 
numerically with this exact solution and thereby gain some insight into the accuracy of 
numerical methods. 

Of course in practice we would not solve a simple linear differential equation such as 
(8.2) by a numerical method. The methods of this section are actually designed for 
equations that can not be solved exactly and for equations that, although solvable 
exactly, have exact solutions so unwieldy that they are practically useless. 


B. The Euler Method 

The Euler method is very simple but not very practical. An understanding of it, 
however, paves the way for an understanding of the more practical (but also more 
complicated) methods which follow. 

Let y denote the exact solution of the initial-value problem that consists of the 
differential equation 

% = f(x ’ y) <8I) 

and the initial condition 


y(*o) = yo- 


(8.6) 
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Let h denote a positive increment in x and let x t = x 0 + h. Then 

C x i A 

fix, y) dx = ~^ dx = ^l) - y( x o)‘ 

%) XO %) Xq 

Since y 0 denotes the value y(x 0 ) of the exact solution y at x 0 , we have 


r 


y(*i) = yo+ f(x,y)dx. 


(8.47) 


If we assume that f(x, y ) varies slowly on the interval x 0 < x < x t , then we can 
approximate f(x, y) in (8.47) by its value f(x 0 , y 0 ) at the left endpoint x 0 . Then 


y(*i) « y 0 + 


P“ fix o, 

Jxo 


yo) dx - 


But 


I' 


fix o, ^o) dx = f(x 0 , y 0 )(x t - x 0 ) = hf(x 0 , y 0 )- 


Thus 


y(*i)~yo + hf(x 0 ,y 0 ). 

Thus we obtain the approximate value y { of y at x x = x 0 + h by the formula 

yi = yo + ¥(x 0 ,yo)- (8.48) 

Having obtained y i by formula (8.48), we proceed in like manner to obtain y 2 by the 
formula ^2 = ^1 + hf(x i9 y 3 by the formula y 3 = y 2 + hf(x 2 , y 2 \ and so forth. In 
general we find y n+ 1 in terms of y n by the formula 

y»+i = y« + Wix n ,y n )- (8.49) 



Figure 8.5 
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Before illustrating the method, we give a useful geometric interpretation. The graph 
of the exact solution y is a curve C in the xy plane (see Figure 8.5). Let P denote the ini- 
tial point (x 0 , y 0 ) and let T be the tangent to C at P. Let Q be the point at which the line 
x = intersects C and let N be the point at which this line intersects T. Then the exact 
value of y at x t is represented by LQ. The approximate value y x is represented by LN, 
since LN = LM + MN = y 0 + PM tan 6 = y 0 + hf(x 0 , y 0 ). The error in approxi- 
mating the exact value of y at x t by y i is thus represented by NQ. The figure suggests 
that if h is sufficiently small, then this error NQ will also be small and hence that the 
approximation will be good. 

It is clear that the Euler method is indeed simple, but it should also be apparent from 
our discussion why it is not very practical. If the increment h is not very small, then the 
errors in the approximations generally will not be small and thus the method will lead 
to quite inaccurate results. If the increment h is very small, then the computations will 
be more lengthy and so the method will involve tedious and time-consuming labor. 


► Example 8.7 

Apply the Euler method to the initial-value problem 

^ = 2 x + y, (8.2) 

y(0) = 1. (8.44) 

Employ the method to approximate the value of the solution y at x = 0.2, 0.4, 0.6, 0.8, 
and 1.0using(l)/z = 0.2,and(2)Ji = 0.1. Obtain results to three figures after the decimal 
point. Compare with the exact value determined from (8.46). 

Solution. 1. We use formula (8.49) with /(x, y) = 2x + y and h = 0.2. From the 
initial condition (8.44), we have x 0 = 0, y 0 = 1. We now proceed with the calculations. 

(a) x t = x 0 + h = 0.2, /(x 0 , y 0 ) = /( 0, 1) = 1.000, 

y i = yo + hf(x o, y 0 ) = 1.000 + 0.2(1.000) = 1.200 

(b) x 2 = x t + h = 0.4, f(x l , y L ) = /(0.2, 1.200) = 1.600, 
y 2 = y t + hf(x i9 y t ) = 1.200 + 0.2(1.600) = 1.520. 

(c) x 3 = x 2 + h = 0.6, f(x 2 ,y 2 ) = /( 0.4, 1.520) = 2.320, 
y 3 = y 2 + hf (x 2 , y 2 ) = 1.520 + 0.2(2.320) = 1.984. 

(d) x 4 = x 3 + h = 0.8, /(x 3 , y 3 ) = /( 0.6, 1.984) = 3.184, 

^ = ^3 + hf(x 3 ,y 3 ) = 1.984 + 0.2(3.184) = 2.621. 

(e) x 5 = x 4 + h = 1.0, /(x 4 , y 4 ) = /( 0.8, 2.621) = 4.221, 
y 5 = y 4 + hf(x 4 , y 4 ) = 2.621 + 0.2(4.221) = 3.465. 

These results, corresponding to the various values of x„, are collected in the second 
column of Table 8.2. 

2. We now use formula (8.49) with / (x, y) = 2x + y and h = 0. 1 . Of course we again 
have x 0 = 0, y 0 = 1. The calculations follow. 

(a) Xl =x 0 + h = 0.1, /(x 0 , y 0 ) = /( 0, 1) = 1.000, 
y± — y 0 + hf(x 0 , y 0 ) = 1.000 + 0.1(1.000) = 1.100. 
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TABLE 8.2 




y n 



using h = 0.2 

using h = 0. 1 

.V 

0.0 

1.000 

1.000 

1.000 

0.1 

— 

1.100 

1.116 

0.2 

1.200 

1.230 

1.264 

0.3 

— 

1.393 

1.450 

0.4 

1.520 

1.592 

1.675 

0.5 

— 

1.831 

1.946 

0.6 

1.984 

2.114 

2.266 

0.7 

— 

2.445 

2.641 

0.8 

2.621 

2.830 

3.076 

0.9 

— 

3.273 

3.579 

1.0 

3.465 

3.780 

4.155 


(b) x 2 = x l + h = 0.2, f(x u y t ) = /( 0.1, 1.100) = 1.300, 
y 2 = yi + hf(x u y,) = 1.100 + 0.1(1.300) = 1.230. 

(c) x 3 = x 2 + h = 0.3, f(x 2 , y 2 ) = /( 0.2, 1.230) = 1.630, 
y 3 = Vi + hf(x 2 , y 2 ) = 1-230 + 0.1(1.630) = 1.393. 

(d) x 4 = x 3 + h = 0.4, f(x 3 ,y 3 ) = /( 0.3, 1.393) = 1.993, 
y 4 = .V 3 + hf(x 3 ,y 3 ) = 1.393 + 0.1(1.993) = 1.592. 

(e) x 5 = x 4 + h = 0.5, f(x 4 , y 4 ) = /(0.4, 1.592) = 2.392, 
y 5 = y 4 + ¥(* 4 , y 4 ) = 1-592 + 0.1(2.392) = 1.831. 

(f) x 6 = x 5 + h = 0.6, /(x 5 , y 5 ) = /( 0.5, 1.831) = 2.831, 
y 6 = y 5 + hf(x 5 , y s ) = 1.831 + 0.1(2.831) = 2.114. 

(g) x 7 = x 6 + h = 0.7, /(x 6 , y 6 ) = /( 0.6, 2.1 14) = 3.314, 
y-! = y 6 + ¥( x e > ye) = 2.114 + 0.1(3.314) = 2.445. 

(h) x 8 = x 7 + h = 0.8, f(x 7 ,y 7 ) = /(0.7, 2.445) = 3.845, 
y 8 = y 7 + hf(x 7 , y 7 ) = 2.445 + 0.1(3.845) = 2.830. 

(i) x 9 = x 8 + h = 0.9, f(x 8 , y s ) = /( 0.8, 2.830) = 4.430, 
y 9 = y 8 + li/(x 8 , y 8 ) = 2.830 + 0.1(4.430) = 3.273. 

(j) x 10 = x 9 + h = 1.0, f(x 9 , y 9 ) = /( 0.9, 3.273) = 5.073, 
y 10 = y 9 + hf(x 9 , y 9 ) = 3.273 + 0.1(5.073) = 3.780. 


These results are collected in the third column of Table 8.2. The values of the exact 
solution y, computed from (8.46) to three figures after the decimal point, are listed in the 
fourth column of Table 8.2. From this table we compute the errors involved in both 
approximations at x = 0.2, 0.4, 0.6, 0.8, and 1 .0. These errors are tabulated in Table 8.3. 

A study of these tables illustrates two important facts concerning the Euler method. 
First, for a fixed value of h, the error becomes greater and greater as we proceed over a 
larger and larger range away from the initial point. Second, for a fixed value of x„, the 
error is smaller if the value of h is smaller. 
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TABLE 8.3 


*» 

Error 

using h = 0.2 

Error 

using h = 0.1 

0.2 

0.064 

0.034 

0.4 

0.155 

0.083 

0.6 

0.282 

0.152 

0.8 

0.455 

0.246 

1.0 

0.690 

0.375 


Exercises 


1. Consider the initial-value problem 

dy » 

Tx = x ~ lh 

y(0) = i. 

(a) Apply the Euler method to approximate the values of the solution y at x = 
0.1, 0.2, 0.3, and 0.4, using h = 0.1. Obtain results to three figures after the 
decimal point. 

(b) Proceed as in part (a) using h = 0.05. 

(c) Find the exact solution of the problem and determine its values at x = 0. 1 , 0.2, 
0.3, and 0.4 (to three figures after the decimal point). 

(d) Compare the results obtained in parts (a), (b), and (c). Tabulate errors as in 
Table 8.3. 

2. Proceed as in Exercise 1 for the initial-value problem 

dy 

Tx= x + y ’ 

m = 2 . 


C. The Modified Euler Method 


In Section B we observed that the value ^(xj of the exact solution y of the initial value 
problem 


(8.1) 

y(* 0 ) = yo. 

(8.6) 

at = x 0 + h is given by 


y(*i) = + /(*, jO dx. 

JXQ 

(8.47) 


In the Euler method we approximated /(x, y) in (8.47) by its value /(x 0 , y 0 ) at the left 
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endpoint of the interval x 0 < x < x t and thereby obtained the approximation 

y t = y 0 + ¥(x 0 ,y 0 ) (8.48) 

for y at x t . It seems reasonable that a more accurate value would be obtained if we were 
to approximate /(x, y) by the average of its values at the left and right endpoints of 
x 0 < x < x l5 instead of simply by its value at the left endpoint x 0 . This is essentially 
what is done in the modified Euler method, which we shall now explain. 

In order to approximate /(x, y) by the average of its values at x 0 and x l9 we need to 
know its value /[x l9 y^)] at x^ However, we do not know the value y(xj of y at x t . 
We must find a first approximation y[ i] for y^), and tQ do this we proceed just as we 
did at the start of the basic Euler method. That is, we take 

y[ l) = y<> + ¥(x 0 ,y 0 ) (8.50) 

as the first approximation to the value of y at x^ Then we approximate /[x l5 y^)] by 
/(x i, y^), using the value y[ l) found by (8.50). From this we obtain 

/(*o>yo) + /(*i,y ( i 1> ) ( 8 51 j 


which is approximately the average of the values of /(x, y) at the endpoints x 0 and x t . 
We now replace /(x, y) in (8.47) by (8.51) and thereby obtain 


, f{x 0 ,y 0 ) + f(x lt y\ 

XI — XO "T ^ 


Ob 


(8.52) 


as the second approximation to the value of y at x^ 

We now use the second approximation y^ ] to obtain a second approximation 
/(x l5 y (2) ) for the value of /(x, y) at x x . From this we proceed to obtain 


y i 3) = y 0 + 


f(x 0 ,yo) + f(xi,y ( n 


(8.53) 


as the third approximation to the value of y at x t . Proceeding in this way we obtain a 
sequence of approximations 

v (l) v (2) y(3) 

XI 5X1 »Xl 5- • 


to the value of the exact solution y at x x . We proceed to compute the successive 
members of the sequence until we encounter two consecutive members that have the 
same value to the number of decimal places required. We take the common value of 
these two consecutive members as our approximation to the value of the solution y at 
x t and denote it by y x . 

Having finally approximated y at x t by y l5 we now move on and proceed to 
approximate y at x 2 = x t + h. We proceed in exactly the same way as we did in finding 
y l . We find successively 

y ( 2 ) = yi + ¥(xi,yi). 


y<?> = y l+ f{X y'y - 0 . + fiX >’ y( £ lh, 


3,(3. = yi + f^uyA + fy^l K 


(8.54) 


2 
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until two consecutive members of this sequence agree, thereby obtaining an ap- 
proximation y 2 to the value of y at x 2 . 

Proceeding further in like manner one obtains an approximation y 3 to the value of y 
at x 3 , and so forth. 


► Example 8.8 

Apply the modified Euler method to the initial-value problem 

d f = 2x + y, (8.2) 

ax 

y( 0)= I- (8.44) 

Employ the method to approximate the value of the solution y at x = 0.2 and x = 0.4 
using h = 0.2. Obtain results to three figures after the decimal point. Compare with the 
results obtained using the basic Euler method with h = 0.1 and with the exact values 
(Example 8.7, Table 8.2). 


Solution. Here /(x, y) = 2x + y, x 0 = 0, and y 0 = 1, and we are to use h = 0.2. We 
begin by approximating the value of y at x t = x 0 + h = 0.2. A first approximation y^ ] 
to this value is found using formula (8.50). Since /(x 0 , y 0 ) = /( 0, 1) = 1.000, we have 

y[ l) = y 0 + hf(x 0 , y 0 ) = 1.000 + 0.2(1.000) = 1.200. 

We now use (8.52) to find a second approximation y^ ] to the desired value. Since 
f{x i, y[ l) ) = /(0.2, 1.200) = 1.600, we have 


. *, + * , , 000 + 1000+L600 (02) , , 260 

We next employ (8.53) to find a third approximation v’i 3) . Since /(x 1 ,y < 1 2, ) = 
/(0.2, 1.260) = 1.660, we find 

+ h , ,.000 + 1000 1 1 660 ,0.2) . 1.266. 


Proceeding in like manner, we obtain fourth and fifth approximations y ( 1 4) and y ( 1 5) , 
respectively. We find 

r r> - + /(>«■ roLLttei yfl h , Lm+ 1000 * 1666 m - ,.267 


and 


y\ i 5) = + 


+ k=im+ L000+L667 (0 2) _ , 26? 


Since the approximations y (4) and y ( 1 5) are the same to the number of decimal places 
required, we take their common value as the approximation y x to the value of the 
solution y at x x = 0.2. That is, we take 

y x = 1.267. 


(8.55) 
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We now proceed to approximate the value of y at x 2 = x t + h = 0.4. For this 
purpose we employ the formulas (8.54), using y t = 1.267. We find successively 

y ( 2 1} = yi + hf(x l9 y x ) = 1.267 + 0.2(1.667) = 1.600, 

l[*^ L ±£*l iyn H = ,, 267 + 1 667 + 2400 ,0.2) = 1.674, 


A 1 ' 

= yi 

yf ] 

= yi 

y 

= yi 


= yi 


= yi 


/(^n ^i) + f(x2,y2 y ) h = J 267 + 1-667 + 2.474 


(0.2)= 1.681, 


/(X " '■) + /(X ” h - 1.267 + 1667 1 1481 , 0 . 2 , = 1 . 682 , 


f(x u yi) + f(x 2 ,y ( 2 } ) 


h = 1.267 + 


> 2 
1.667 + 2.482 


(0.2) = 1.682. 


Since the approximations y * 2 4) and y ( 2 5) are both the same to the required number of 
decimal places, we take their common value as the approximation y 2 to the value of the 
solution y at x 2 = 0.4. That is, we take 


y 2 = 1.682. (8.56) 

We compare the results (8.55) and (8.56) with those obtained using the basic Euler 
method with h = 0.1 and with the exact values. For this purpose the various results and 
the corresponding errors are listed in Table 8.4. 

The principal advantage of the modified Euler method over the basic Euler method 
is immediately apparent from a study of Table 8.4. The modified method is much more 
accurate. At x = 0.4 the error using the modified method with h = 0.02 is 0.007. The 
corresponding error 0.083 using the basic method with = 0.1 is nearly twelve times as 
great, despite the fact that a smaller value of h was used in this case. Of course at each 
step the modified method involves more lengthy and complicated calculations than the 
basic method. We note, however, that the basic method would require many individual 
steps to give a result as accurate as that which the modified method can provide in a 
single step. 


TABLE 8.4 




Using basic Euler method 

Using modified Euler 




with h = 0.1 

with h = 0.2 



Exact value of y 

Approximation 

Approximation 



(to three decimal places) 

y n Error 

y n 

Error 

0.2 

1.264 

1.230 0.034 

1.267 

0.003 

0.4 

1.675 

1.592 0.083 

1.682 

0.007 


Exercises 

1. Consider the initial- value problem 

^ = 3x + 2 y, y(0) = 1. 
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(a) Apply the modified Euler method to approximate the values of the solution y 
at x = 0.1, 0.2, and 0.3 using h = 0.1. Obtain results to three figures after the 
decimal point. 

(b) Proceed as in part (a) using h = 0.05. 

(c) Find the exact solution of the problem and determine its values at x = 0. 1 , 0.2, 
and 0.3 (to three figures after the decimal point). 

(d) Compare the results obtained in parts (a), (b), and (c), and tabulate errors. 

2. Proceed as in Exercise 1 for the initial-value problem 


dy_ 

dx 


2x - y. 


y( 0) = 3. 


3. Consider the initial-value problem 


dy_ 

dx 


x 2 + y 2 , 


y( 0) = i. 


(a) Apply the modified Euler method to approximate the values of the solution y 
at x = 0.1, 0.2, and 0.3, using h = 0.1. Obtain results to three figures after the 
decimal point. 

(b) Apply the Euler method to approximate the values of the solution y at 
x = 0.1, 0.2, and 0.3, using h = 0.1. Obtain results to three figures after the 
decimal point. 

(c) Compare the results obtained in parts (a) and (b) and tabulate errors. 


D. The Runge-Kutta Method 

We now consider the so-called Runge-Kutta method for approximating the values of 
the solution of the initial-value problem 

j^ = f(x,y), (8.1) 

>>(xo) = yo> (8-6) 

at Xi = x 0 + h , x 2 = x x + h , and so forth. This method gives surprisingly accurate 
results without the need of using extremely small values of the interval h. We shall give 
no justification for the method but small merely list the several formulas involved and 
explain how they are used.* 

To approximate the value of the solution of the initial-value problem under 
consideration at x x = x 0 + h by the Runge-Kutta method, we proceed in the following 
way. We calculate successively the numbers k u k 2 , fc 3 , /c 4 , and K 0 defined by the 


See F. Hildebrand, Introduction to Numerical Analysis, 2nd ed. (McGraw-Hill, New York, 1974). 
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formulas 

ki = hf(x 0 ,y 0 ), 

. h k.\ 

k 2 ~ ¥i x o + 2 ’ yo + y )* 

/c 3 = / J /(xo + ^o+y), (8-57) 

k 4 = ¥( x o + h,y 0 + k 3 ), 
and 

*o = 6 i + 2 /c 2 + 2/c 3 + fc 4 ). 

Then we set 

yi = yo + (8.58) 

and take this as the approximate value of the exact solution at x t = x 0 + h. 

Having thus determined y u we proceed to approximate the value of the solution at 

*2 = *i + h in an exactly similar manner. Using x t = x 0 + h and y t as determined by 

(8.58), we calculate successively the numbers k l ,k 2 ,k 3 , /c 4 , and defined by 

k i = ¥(xi,yi), 

k 2 = ¥(*1 + ^,yi+ y), 

¥ = ¥^ x i + y yi + y), (8-59) 

K = ¥( X 1 + h,y t + k 3 ), 

and 

K-i = 6 (k i + 2 /c 2 -h 2 /c 3 + k 4 ). 

Then we set 

y 2 = yi + K t (8.60) 

and take this as the approximate value of the exact solution at x 2 = x x + h. 

We proceed to approximate the value of the solution at x 3 = x 2 + h , x 4 = x 3 + h , 
and so forth, in an exactly similar manner. Letting y n denote the approximate value 
obtained for the solution at x„ = x 0 + nh , we calculate successively k t ,k 2 ,k 3 , fc 4 , and 
K n defined by 

K = h f(x n ,y„), 

¥ = ¥^ x n + ^,y n + y)> 

k 3 = ¥^ x n + 2> y» + y^> 

^4 = ¥( x n + h,y n + ¥), 



8.4 NUMERICAL METHODS 405 


and 

= i(^i + 2 /c 2 + 2k 3 + k 4 ). 

Then we set 


y , ,+ 1 = y n + K n 

and take this as the approximate value of the exact solution at x„ + 1 = x„ + h. 

► Example 8.9 

Apply the Runge-Kutta method to the initial-value problem 

dv 

^ = 2x + y, (8.2) 

^(0) = I- (8.44) 

Employ the method to approximate the value of the solution y at x = 0.2 and x = 0.4 
using h = 0.2. Carry the intermediate calculations in each step to five figures after the 
decimal point, and round off the final results of each step to four such places. Compare 
with the exact value. 

Solution. Here /(x, y) = 2x + y, x 0 = 0, y 0 = 1, and we are to use h = 0.2. Using 

these quantities we calculate successively k u k 2 , k 3 , fc 4 , and K 0 defined by (8.57). We 

first find 


k x = hf(x 0 , y 0 ) = 0.2/(0, 1) = 0.2(1) = 0.20000. 


Then since 


+ 5 = 0 + 1 ( 0 . 2 ) = 0.1 


and 


we find 


Next, since 


we find 


k 1 

y 0 + Y = 1 00000 + 2 (°- 2000 °) = 1-1°°00, 

k 2 = hf(x 0 +^,y 0 + y) = °.2/(0.1, 1.10000) 

= 0.2(1.30000) = 0.26000. 

y 0 +Y = 1 00000 + \ (0-26000) = 1.13000, 
k 3 = hf(x o + Y y* + y) = 0.2/(0.1,-1.13000) 


= 0.2(1.33000) = 0.26600. 
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Since x 0 + h = 0.2 and y 0 + k 3 .= 1.00000 + 0.26600 = 1.26600, we obtain 

k 4 = hf(x 0 + h,y 0 + k 3 ) = 0.2/(0.2, 1.26600) 

= 0.2(1.66600) = 0.33320. 

Finally, we find 

K 0 = £(/c, + lk 2 + 2 k 3 + k 4 ) = ^(0.20000 + 0.52000 + 0.53200 + 0.33320) 

= 0.26420. 

Then by (8.58) the approximate value of the solution at x, = 0.2 is 

y 1 = 1 + 0.2642 = 1.2642. (8.61) 

Now using y ( as given by (8.61), we calculate successively k t , k 2 , k 3 , k 4 , and 
defined by (8.59). We first find 

k l = hf(x u y.) = 0.2/(0.2, 1.2642) = 0.2(1.6642) = 0.33284. 

Then since 

Xl + ^ = 0.2 + ^ (0.2) = 0.3 
and 

y i +~= 1.26420 + X - (0.33284) = 1.43062, 

we find 

k 2 = hf(x, + y3'i+y) = 0.2/(0.3, 1.43062) 

= 0.2(2.03062) = 0.40612. 

Next, since 

yi+Y= 1.26420 + ^(0.40612) = 1.46726, 

we find 

k 3 = hf(x t + y Fi + y) = 0- 2 /(0.3, 1 .46726) 

= 0.2(2.06726) = 0.41345. 

Since x t + h = 0.4 and y t + k 3 = 1.26420 + 0.41345 = 1.67765, we obtain 

k 4 = hf (x | +h, yi + k 3 ) = 0.2/(0.4, 1.67765) 

= 0.2(2.47765) = 0.49553. 

Finally, we find 

= i(ki + 2fc 2 + 2k 3 + fc 4 ) 

= £(0.33284 + 0.81224 + 0.82690 + 0.49553) = 0.41125. 

We round off K t according to the well-known rule of rounding off so as to leave the 
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last digit retained an even one. Then by (8.60) we see that the approximate value of the 
solution at x 2 = 0.4 is 

y 2 = 1.2642 + 0.41 12 = 1.6754. (8.62) 

As before, the exact values are determined using (8.46). Rounded off to four places 
after the decimal point, the exact values at x = 0.2 and x = 0.4 are 1.2642 and 1.6754, 
respectively. The approximate value at x = 0.2 as given by (8.61) is therefore correct to 
four places after the decimal point, and the approximate value at x = 0.4 as given by 
(8.62) is likewise correct to four places! 

The remarkable accuracy of the Runge-Kutta method in this problem is certainly 
apparent. In fact, if we employ the method to approximate the solution at x = 0.4 using 
h = 0.4 (that is, in only one step); we obtain the value 1.6752, which differs from the 
exact value 1.6754 by merely 0.0002. 


Exercises 


1. Consider the initial-value problem 

%- ix+iy - 

y(0) = 1. 

(a) Apply the Runge-Kutta method to approximate the values of the solution y 
at x = 0.1, 0.2, and 0.3, using h = 0.1. Carry the intermediate calculations in 
each step to five figures after the decimal point, and round off the final results 
of each step to four such places. 

(b) Find the exact solution of the problem and compare the results obtained in 
part (a) with the exact values. 

2. Proceed as in Exercise 1 for the initial-value problem 

dy . 

Tx- 2x ~ y - 

y(0) = 3. 

3. Proceed as in part (a) of Exercise 1 for initial-value problem 

e-’-* 

y( 0) = i. 


E. The Milne Method 

The Euler method, the modified Euler method, and the Runge-Kutta method are all 
starting methods for the numerical solution of an initial-value problem. As we have 
already pointed out, a starting method is a method that can be used to start the 
solution. In contrast, a continuing method is one that cannot be used to start the 
solution but that can be used to continue it, once it is sufficiently well started. In this 
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section we consider briefly a useful continuing method, the so-called Milne method. We 
shall give no justification for this method but shall merely list the formulas involved and 
indicate how they are employed.* 

The Milne method can be used to approximate the value of the solution of the initial- 
value problem 

% = fix, y), ( 8 . 1 ) 


y{x 0 ) = yo ( 8 . 6 ) 

at x„ +1 = x 0 + (n 4* 1 )h, provided the values at the four previous points x„_ 3 , x„_ 2 , 
x n _ ! , and x„ have been determined. We assume that these four previous values have 
been found and denote them by y„_ 3 , y„_ 2 , y n -u an d y n > respectively. Then we can 
use ( 8 . 1 ) to determine dy/dx at x„_ 2 , x„_ l5 and x„. That is, we can determine 
y'n-2 = f(x„-2> y«-i), y'n-1 = fiXn-u - 1 ), and = f(x n , y„). These various num- 
bers being determined, the Milne method proceeds as follows: 

We first determine the number y^ t given by the formula 

4 h 

y!i+i = y*-3 + y (2 y'„ -y'„-i + 2 y' n - 2 ). (8.63) 


Having thus determined we next determine the number y'„ ( +\ given by 

/n+i = f(x n + i,ynh)- (8.64) 

Finally, having determined yJ/Vi, we proceed to determine the number yfl , given by 


y ( n 2 h = y n -i + 3(^+1 + 4 y'„ + y'-J. 


(8.65) 


If the numbers , determined from (8.63) and y { n f , determined from (8.65) are the 
same to the number of decimal places required, then we take this common value to be 
the approximate value of the solution at x„ + 1 and denote it by y n+l . 

If the numbers x and y ( n 2 l L so determined do not agree to the number of decimal 
places required and all of the calculations have been checked and appear to be correct, 
then we proceed in the following way. We calculate the number 


E = 


y(2) _ yd) 

y n+ 1 y n + 1 

29 


which is the principal part of the error in the formula (8.65). If E is negligible with 
respect to the number of decimal places required, then we take the number y[ 2 l x given 
by (8.65) as the approximate value of the solution at x„ + 1 and denote it by y n + 1 . On the 
other hand, if E is so large that it is not negligible with respect to the number of decimal 
places required, then the value of h employed is too large and a smaller value must be 
used. 

We observe that once the values y 0 ,y l ,y 2 ,^ ( i ^3 have been determined, we can use 
the Milne formulas with n = 3 to determine y 4 . Then when y 4 has been determined by 
the formulas, we can use them with n = 4 to determine y 5 . Then proceeding in like 

manner, we can successively determine y 6 , y 7 , But we must have y 0 > y i > yi > an d ^3 i n 

order to start the Milne method. Of course y 0 is given exactly by the initial condition 


See J. Scarborough, Numerical Mathematical Analysis, 6th ed. (Johns Hopkins, Baltimore, 1966). 



8.4 NUMERICAL METHODS 409 


(8.6), and we can find y ^ , y 2 , and y 3 by one of the previously explained starting methods 
(for example, by the Runge-Kutta method). 


► Example 8.1 0 

Apply the Milne method to approximate the value at x = 0.4 of the solution y of the 
initial-value problem 

dv 

— = 2 x + y, (8.2) 

y( 0) = 1, (8.44) 

assuming that the values at 0.1, 0.2, and 0.3 are 1.1155, 1.2642, and 1.4496, respectively. 

Solution. We apply the formulas (8.63), (8.64), and (8.65) with n = 3. We set 


o' 

II 

o 

X 

y 0 = 1.0000, 

x t = 0.1, 

II 

x 2 = 0.2, 

y 2 = 1.2642, 

x 3 = 0.3, 

y 3 = 1-4496, 

and x 4 = 0.4. Then using f(x, y) = 2x + y, 

we find 


yi =/(*!, yi) = /(0.1, 1.1155)= 1.3155, 
y '2 = f(x 2 , y 2 ) = m2, 1.2642) = 1.6642, 

? 3 = fix 3 , y 3 ) = /( 0.3, 1.4496) = 2.0496. 

We now use (8.63) with n = 3 and h = 0.1 to determine y^K We have 

y* l) = .Vo + (2 y' 3 - y' 2 + 2y\) 

= 1.000 + ^(4.0992 - 1.6642 + 2.6310) = 1.6755. 

Having thus determined we use (8.64) with n = 3 to determine We find 

y'l l) = f(xt, yi l) ) = /(0.4, 1.6755) = 2.4755. 

Finally, having determined we use (8.65) with n = 3 and h = 0.1 to determine y { ^\ 
We obtain 

y 4 2) = y 2 + y- (y’J 1 ' + 4/ 3 + y 2 ) 

= 1.2642 + ~ (2.4755 + 8.1984 + 1.6642) 

= 1.6755. 

Since the numbers and y^ 2) agree to four decimal places, we take their common 
value as the approximate value of the solution at x 4 = 0.4 and denote it by y 4 . That is, 
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we set 

y 4 = 1.6755. 

Using (8.46), the exact value at x = 0.4, rounded off to four places after the decimal 
point, is found to be 1.6754. 

Exercises 

1. Apply the Milne method to approximate the value at x = 0.4 of the solution y of 
the initial-value problem 

j- = 3x + 2 y, 
ax 

y( 0) = i, 

assuming that the values at 0.1, 0.2, and 0.3 are 1.2375, 1.5607, and 1.9887, 
respectively. 

2. Apply the Milne method to approximate the value at x = 0.4 of the solution y of 
the initial-value problem 


y( 0) = 3, 

assuming that the values at 0.1, 0.2, and 0.3 are 2.7242, 2.4937, and 2.3041, 
respectively. 



CHAPTER NINE 

The Laplace Transform 


In this chapter we shall introduce a concept that is especially useful in the solution 
of initial-value problems. This concept is the so-called Laplace transform, which 
transforms a suitable function / of a real variable t into a related function F of a real 
variable 5. When this transform is applied in connection with an initial-value problem 
involving a linear differential equation in an “unknown” function of t , it transforms the 
given initial-value problem into an algebraic problem involving the variable s. In 
Section 9.3 we shall indicate just how this transformation is accomplished and how the 
resulting algebraic problem is then employed to find the solution of the given initial- 
value problem. First, however, in Section 9.1 we shall introduce the Laplace transform 
itself and develop certain of its most basic and useful properties. 


9.1 DEFINITION, EXISTENCE, AND BASIC PROPERTIES OF THE 
LAPLACE TRANSFORM 

A. Definition and Existence 

DEFINITION 

Let f be a real-valued function of the real variable t , defined for t > 0. Let shea variable 
that we shall assume to be real , and consider the function F defined by 

F(s) = j“e~*f(t)dt, (9.1) 

for all values of s for which this integral exists. The function F defined by the integral 
(9.1) is called the Laplace transform of the function f. We shall denote the Laplace 
transform F of f by if {/} and shall denote F(s) by if {/(t)}. 
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In order to be certain that the integral (9. 1 ) does exist for some range of values of s, we 
must impose suitable restrictions upon the function / under consideration. We shall do 
this shortly; however, first we shall directly determine the Laplace transforms of a few 
simple functions. 


► Example 9.1 


Consider the function / defined by 

f(t) = 1, for t > 0. 

Then 


r oo rR r_e-sn* 

i?{l}= e~ st • 1 dt = lim • 1 ^ = lim 

Jo R->co Jo R~> a> L 5 JO 

R-* o o S J S 


for all s > 0. Thus we have 


( s > 0 ). 


► Example 9.2 


Consider the function / defined by 

f(t) = t , for t > 0. 

Then 


f 00 C R f e~ st 

e~ st 'tdt= lim e~ sl -tdt= lim ^-(st + 1) 

Jo R-> 00 Jo R-^OO [_ S 


= lim 

R-* oo 

for all s > 0. Thus 


"1 e~ sR 
_s 2 s 2 


(sR + 1) 


= - (s > 0). 


(9.2) 


(9.3) 


► Example 9.3 

Consider the function / defined by 

f(t) = e“', for t > 0. 


if {e m } = (* e s 'e m dt = lim j e (a s)t dt = lim — 
Jo R^co Jo R^ao [_& ~ 

r e {a ~ s)R i i_ i i 

[_a — s a — s J a — s s — a 


= lim 

R-* oo I 


for all s > a. 
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Thus 


$£{e at } = 


s — a 


(s > a). 


(9.4) 


► Example 9.4 


Consider the function / defined by 

f(t) = sin bt for t > 0. 


if {sin bt} = 


e st • sin bt dt = lim 

o R~* oo 


sin bt dt 


= lim — 


R-* oo 


s 2 T* b J 


( s sin bt + b cos bt) 


= lim 

R-*ao 


s 2 + b 2 s 2 + b 2 


(s sin bR + b cos bR) 


Thus 


s 2 + b 2 


for all s > 0. 


i?{sin bt} = 


s 2 + b 2 


(s > 0). 


(9.5) 


► Example 9.5 


Consider the function / defined by 

f(t) = cos bt for t > 0. 


if {cos 


bt} = j* c“ s '- 

= lim I 

R oo L 

= lim I 

R^ oo l_ 


CR 


cos bt dt = lim 

R-> oo 


e st cos bt dt 


s 2 + b 2 

-sR 


( — 5 cos bt + b sin bt) 


€ S 

( — 5 cos bR + b sin bR) + 


s 2 + b 


s 2 -b b 2 

for all 5 > 0. 


s 2 + b 2 


Thus 


if {cos bt} = (s > 0) - (9 ' 6) 

In each of the above examples we have seen directly that the integral (9.1) actually does 
exist for some range of values of 5. We shall now determine a class of functions / for 
which this is always the case. To do so we first consider certain properties of functions. 
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DEFINITION 

A function f is said to be piecewise continuous (or sectionally continuous) on a finite 
interval a < t <bif this interval can be divided into a finite number of subintervals such 
that (1) f is continuous in the interior of each of these subintervals , and (2) f(t) ap- 
proaches finite limits as t approaches either endpoint of each of the subintervals from its 
interior. 

Suppose / is piecewise continuous on a < t < fe, and t 0 , a < t 0 < b, is an endpoint 
of one of the subintervals of the above definition. Then the finite limit approached by 
f(t) as t approaches t 0 from the left (that is, through smaller values of t) is called the 
left-hand limit of f(t) as t approaches t 0 , denoted by lim r ^ ro _ f(t) or by f(t 0 — ). 
In like manner, the finite limit approached by f(t) as t approaches t 0 from the right 
(through larger values) is called the right-hand limit of f(t) as t approaches t 0 , 
denoted by lim,_>, 0 + f(t) or f(t 0 + ). We emphasize that at such a point r 0 , both 
f(t 0 —) and f(t 0 +) are finite but they are not in general equal. 

We point out that if / is continuous on a < t < b it is necessarily piecewise 
continuous on this interval. Also, we note that if / is piecewise continuous on 
a < t < b, then / is integrable on a < t < b. 




Figure 9.1 
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DEFINITION 

A function f is said to be of exponential order if there exists a constant a and positive 
constants t 0 and M such that 

e~ at \f{t)\ < M (9.7) 

for all t > t 0 at which f(t) is defined. More explicitly , if f is of exponential order 
corresponding to some definite constant a in (9.7), then we say that f is of exponential 
order e at . 

In other words, we say that / is of exponential order if a constant a exists such that 
the product e~ at \f(t)\ is bounded for all sufficiently large values of t. From (9.7) we 
have 

\m\<Me« (9.8) 

for all t > t 0 at which f(t) is defined. Thus if / is of exponential order and the values 
f(t) of / become infinite as t -+ oo, these values cannot become infinite more rapidly 
than a multiple M of the corresponding values e at of some exponential function. We 
note that if / is of exponential order e at , then / is also of exponential order e fit for any 
f > a. 

► Example 9.7 

Every bounded function is of exponential order, with the constant a = 0. Thus, for 
example, sin bt and cos bt are of exponential order. 

► Example 9.8 

The function / such that f(t) = e at sin bt is of exponential order, with the constant 
a = a. For we then have 


e -*\f(t)\ = e- m e*\sin bt | = |sin bt |, 
which is bounded for all t. 

► Example 9.9 

Consider the function / such that f(t) = t n , where n > 0. Then e~ at \ f(t)\ is e~ at t n . For 
any a > 0, lim,^ e~ at t n = 0. Thus there exists M > 0 and t 0 > 0 such that 

e~ at \f(t)\ =e~ at t n <M 

for t > t 0 . Hence f(t) = t n is of exponential order, with the constant a equal to any 
positive number. 

► Example 9.10 

The function / such that f{t) = e t2 is not of exponential order, for in this case 
e~ at \f{t)\ is e t2 ~ at and this becomes unbounded as t -+ oo, no matter what is the value 
of a. 
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We shall now proceed to obtain a theorem giving conditions on / that are sufficient 
for the integral (9. 1) to exist. To obtain the desired result we shall need the following two 
theorems from advanced calculus, which we state without proof. 

THEOREM A Comparison Test for Improper Integrals 
Hypothesis 

1. Let g and G be real functions such that 

0 < g(t) < G(t) on a < t < oo. 

2. Suppose G(t) dt exists. 

3. Suppose g is integrable on every finite closed subinterval of a < t < oo. 
Conclusion. Then J® g(t) dt exists. 

THEOREM B 
Hypothesis 

1. Suppose the real function g is integrable on every finite closed subinterval of 
a < t < oo. 

2. Suppose \g(t)\ dt exists. 

Conclusion. Then g(t) dt exists. 

We now state and prove an existence theorem for Laplace transforms. 
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for t > t 0 . Also 


Thus 



Me~ (s ~ a)t dt 



*oo 

M e ~ (s ~ a)t dt exists for s > a. 

to 


Finally, by Hypothesis 1, e~ st \f(t)\ is integrable on every finite closed subinterval of 
t 0 < t < oo. Thus, applying Theorem A withg(r) = e~ st \f(t )\ and G(t) = Me _(s_a) ',we 
see that 


In other words, 



*“*1/(01* 


\e~ st m\dt 


exists if s > a. 


exists if s > a, 


and so by Theorem B 



e st f(t) dt 


also exists if s > a. Thus the Laplace transform of / exists for s > a. 


Q.E.D. 


Let us look back at this proof for a moment. Actually we showed that if / satisfies the 
hypotheses stated, then 

*oo 

*” st l/(OI* exists if s > a. 

J to 

Further, Hypothesis 1 shows that 

m t 0 

e~ st \f(t)\ dt exists. 

Jo 


Thus 


£ 


e st \f{t)\ dt exists if s > a. 


In other words, if / satisfies the hypotheses of Theorem 9.1, then not only does $£ {/} 
exist for s > a, but also if {| / 1} exists for 5 > a. That is, 


£ 


e st f (t) dt is absolutely convergent for s > a. 
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We point out that the conditions on / described in the hypothesis of Theorem 9. 1 are 
not necessary for the existence of if {/}. In other words, there exist functions / that do 
not satisfy the hypotheses of Theorem 9.1, but for which if {/} exists. For instance, 
suppose we replace Hypothesis 1 by the following less restrictive condition. Let us 
suppose that / is piecewise continuous in every finite closed interval a < t < h, where 
a > 0, and is such that | t n f(t)\ remains bounded as t -+ 0 + for some number n , where 
0 < n < 1. Then, provided Hypothesis 2 remains satisfied, it can be shown that if {/} 
still exists. Thus for example, if f(t) = t ~ 1/3 , t > 0, if {/} exists. For although / does 
not satisfy the hypothesis of Theorem 9.1 [f(t) -+ oo as t -+ 0 + ], it does satisfy the less 
restrictive requirement stated above (take n = §), and / is of exponential order. 


Exercises 


In each of Exercises 1-6, use the definition of the Laplace transform to find if {/(t)} 
for the given f(t). 

1. f(t) = t 2 . 


2. / ( t ) = sinh t. 

f5, Oct <2, 

[ 0 , 


4. 


3. f(t) = 
f(t) = 

5. f(t) = 

6 . m = 


t > 2. 

4, 0 < t < 3, 

2, r > 3. 

t, 0 < f < 2, 

3, t > 2. 

0, 0 < r < 1, 

t, 1 < t < 2, 

1, t>2. 


B. Basic Properties of the Laplace Transform 
THEOREM 9.2 The Linear Property 

Let / x and f 2 be functions whose Laplace transforms exist , and let c x and c 2 be constants. 
Then 

+ c 2 f 2 (t)} = C^{/ ( (I)} + c 2 ^{/ 2 (0}. (9.9) 

Proof. This follows directly from the definition (9.1). 

► Example 9.1 1 

Use Theorem 9.2 to find if {sin 2 at}. Since sin 2 at = (1 — cos 2at)/2 , we have 

if {sin 2 at} = if {^ - j cos 2 at}. 
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By Theorem 9.2, 


By (9.2), if { 1 } = 


1 /s, and 
if {sin 2 


— j cos 2 at} = {1} — | if {cos 2 at}. 

by (9.6), if {cos 2 at} = s/(s 2 + 4a 2 ). Thus 

.111 s _ 2a 2 

at 2 s 2 s 2 + 4a 2 s(s 2 + 4a 2 )* 


(9.10) 


THEOREM 9.3 
Hypothesis 

1. Let f be a real function that is continuous for t > 0 and of exponential order e at . 

2. Let f' ( the derivative of f) be piecewise continuous in every finite closed interval 
0 < t < b. 


Conclusion. Then if {/'} exists for s > a; and 

if{/'(r)} = sif{/(0}-/(0). 


(9.11) 


Proof. By definition of the Laplace transform, 

W(0}= lim [ R e~ s f ft) dt, 

R -+00 Jo 

provided this limit exists. In any closed interval 0 < t < R, f\t) has at most a finite 
number of discontinuities; denote these by t l9 t 2i . . . , t„, where 

0 < t x < t 2 < < t n < R. 


Then we may write 

*r fti ft 2 r-R 

e~ st f\t) dt = e~ st f\t) dt + e~ st f\t) dt + ••• + e- st f'(t)dt. 

% 0 Jo J tj J tn 

Now the integrand of each of the integrals on the right is continuous. We may therefore 
integrate each by parts. Doing so, we obtain 



By Hypothesis 1, / is continuous for t > 0. Thus 


/(*!-) = f(h+)> fit 2~) = fill +),•••, /(«„-) = f(tn + )• 

Thus all of the integrated “pieces” add out, except for e -s '/(r)| l=0 and e~ s ‘f(t)\ t=R _ , 
and there remains only 

f " e~T(t) dt = -/(0) + e~ sR f(R) + s e~J(t) dt. 

Jo Jo 

But by Hypothesis 1 / is of exponential order e M . Thus there exists M > 0 and t 0 > 0 
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such that e~ at \f(t)\ < M for t > t 0 . Thus \e~ sR f(R)\ < Me~ {s ~ a)R for R > t 0 . Thus if 
s > a, 


Further, 


Thus, we have 


lim e~ sR f(R) = 0. 

R-> oo 


lim s f e s, f(t)dt = sSe{f{t)}. 
R-* °o Jo 


lim f e s, f'(t) dt = — /(0) + s&{f(t)}, 
R~> oo Jo 

and so &{f'(t)} exists for s > a and is given by (9.1 1). 


Q.E.D 


► Example 9.1 2 

Consider the function defined by f(t) = sin 2 at. This function satisfies the hypotheses 
of Theorem 9.3. Since f'(t) = 2 a sin at cos at and /(0) = 0, Equation (9.1 1) gives 


if {2 a sin at cos at} = sif {sin 2 at}. 


By (9.10), 


Thus, 


if {sin 2 at} = 


la 2 


5(5 2 -F 4a 2 ) 


if {2a sin at cos at} = 


2a 2 


s 2 + 4a 2 * 


Since 2a sin at cos at = a sin 2at, we also have 


if {sin 2at} = 


2a 


s 2 + 4a 2 * 

Observe that this is the result (9.5), obtained in Example 9.4, with b = la. 
We now generalize Theorem 9.3 and obtain the following result: 


THEOREM 9.4 
Hypothesis 

1. Let f be a real function having a continuous (n — l)st derivative f (n ~ l) ( and hence f 

/ ( " " 2) are Also continuous ) for t > 0; and assume that / ( " “ 1 } are all of 

exponential order e at . 

2. Suppose f {n) is piecewise continuous in every finite closed interval 0 < t < b. 
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Conclusion. if {f (n) } exists for s > a and 

&{p n \t)} = s n <?{m} - s n ~ i m - s n ~ 2 m - 5 "- 3 /"( 0 ) - ••• - 

(9.12) 

Outline of Proof. One first proceeds as in the proof of Theorem 9.3 to show that 
if { f {n) } exists for 5 > a and is given by 

&{fW}=s& 0 ). 

One then completes the proof by mathematical induction. 


► Example 9.1 3 

We apply Theorem 9.4, with n = 2, to find if (sin bt}, which we have already found 
directly and given by (9.5). Clearly the function / defined by f(t) = sin bt satisfies the 
hypotheses of the theorem with a = 0. For n = 2, Equation (9.12) becomes 

= s 2 &{f(t)} - s/( 0) - /'(0). (9.13) 

We have f'(t) = b cos bt, f"(t) = —b 2 sin bt, /( 0) = 0, /'(0) = b. Substituting into 
Equation (9.13) we find 

y{ — b 2 sin bt} = s 2 F £ (sin bt} - b, 

and so 

(s 2 + b 2 )£F {sin bt} = b. 

Thus, 

& (sin bt} = bi (s > 0), 
which is the result (9.5), already found directly. 


THEOREM 9.5 Translation Property 


Hypothesis. Suppose f is such that if {/} exists for s > a. 

Conclusion. For any constant a , 

^{e a m} = F(s - a) 
for s > a + a, where F(s) denotes Jz? {/(f)}. 


Proof. F(s) = {f(t)} = e s, f{t) dt. Replacing s by s — a, we have 


F(s - a) = 


- (s~a)t 


7(0 dt 


-r 


«“*[«*/(0] dt = &{e m f(t)}. 


(9.14) 


o 


Q.E.D 
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► Example 9.14 

Find if {e at t}. We apply Theorem 9.5 with f(t) = t. 

&{e at t) = F(s - a), 

where F(s) = J = if {r}. By (9.3), if {t} = 1/s 2 (s > 0). That is, F(s ) = 1/s 2 and 
so F(s — a) = l/(s — a) 2 . Thus 

P-15) 


► Example 9.1 5 

Find F£{e M sin bt}. We let f(t) = sin bt. Then if {e m sin bt} = F(s — a), where 
F(s) = if (sin bt} = ^ ^ (s > 0). 


Thus 


and so 


F(s - a) = 


b 

(s — a) 2 + h 2 


if^ 1 " sin hr} 


b 

(s - a) 2 + b 2 


(s > a). 


(9.16) 


THEOREM 9.6 

Hypothesis. Suppose f is a function satisfying the hypotheses of Theorem 9.1, with 
Laplace transform F, where 

F(s) = j* e~ sl f(t) dt. (9.17) 

Conclusion 

2’{m)}=(-ir£ ; \_F(s)i (9.i8) 

Proof. Differentiate both sides of Equation (9.17) n times with respect to s. This 
differentiation is justified in the present case and yields 

F(s) = (- 1) 1 JV s 'r/(r)dt, 

F"(s) = (- l) 2 JVv/(r)dr, 

F (n, (s) = (-l) n | e~ s, t"f(t) dt, 
from which the conclusion (9.18) is at once apparent. 


Q.E.D 
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► Example 9.1 6 

Find if {t 2 sin bt}. By Theorem 9.6, 

if {r 2 sin bt} = ( - !) 2 C F ( S )]> 


where 


(using (9.5)). From this, 


and 


Thus, 


F(s) = if {sin bt} 


b 

s 2 + b 2 



2 bs 

(s 2 + b 2 ) 2 



6 bs 2 — 2 b 3 
(, s 2 + b 2 ) 3 ’ 


if {t 2 sin bt} 


6 bs 2 — 2 b 3 

l s 2 + b 2 ) 3 ' 


Exercises 

1. Use Theorem 9.2 to find if {cos 2 at}. 

2. Use Theorem 9.2 to find if {sin at sin bt}. 

3. Use Theorem 9.2 to find if {sin 3 at} and then employ Theorem 9.3 to obtain 
if {sin 2 at cos at}. 

4. Use Theorem 9.2 to find if {cos 3 at} and then employ Theorem 9.3 to obtain 
if {cos 2 at sin at}. 

5. If if {f 2 } = 2/s 3 , use Theorem 9.3 to find if {t 3 }. 

6. If if {t 2 } = 2/s 3 , use Theorem 9.4 to find if {t 4 }. 

7. Use (9.11) and (9.13) to find if {/(f)} if 

AO + 3/'(0 + 2/(0 = 0, /(0) = 1, and /'( 0) = 2. 

8. Use (9.11) and (9.13) to find if {/(t)} if 

no + 4/'(0 - 8/(0 = 0, /(0) = 3, /'(0) = - 1. 

9. Use Theorem 9.5 to find <£{e at t 2 }. 

10. Use Theorem 9.5 to find F£{e at sin 2 bt}. 

11. Use Theorem 9.6 to find if {f 2 cos bt}. 

12. Use Theorem 9.6 to find if [t 3 sin bt}. 
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13. Use Theorem 9.6 to find if {t 3 e at }. 

14. Use Theorem 9.6 to find if {t 4 e m }. 


C. Step Functions, Translated Functions, and Periodic Functions 


In the application of the Laplace transform to certain differential equations prob- 
lems, we shall need to find the transform of a function having one or more finite 
discontinuities. In dealing with these functions, we shall find the concept of the so- 
called unit step function to be very useful. 

For each real number a > 0, the unit step function u a is defined for nonnegative t by 


K(t) = 


(9.19) 


fO, t < a , 

^1, t > a 

(see Figure 9.2a). In particular, if a = 0, this formally becomes 

(0, t < 0, 
t > 0; 

but since we have defined u a in (9.19) only for nonnegative t, this reduces to 

u 0 (t) =1 for t > 0 

(see Figure 9.2 b). 

The function u a so defined satisfies the hypotheses of Theorem 9.1, so J?{u a (t)} 
exists. Using the definition of the Laplace transform, we find 


(9.20) 




K(t)} = e s, u c 


,(r) dt = 


e~ s '(0) dt + 


* o 

Ja 


"(1) dt 



r R Y- e ~ sr 

0 + lim 

e~ st dt = lim 

R-* oo % 

a R~>co _ S 


= lim 

R-* oo 


— e sR + e~ 


for s > 0. 


Thus we have 


-as 

&{u a {t)}= (s> 0). (9.21) 

5 

A variety of so-called step functions can be expressed as suitable linear combinations 
of the unit step function u a . Then, using Theorem 9.2 (the linear property), and 
if {u a (t)}, we can readily obtain the Laplace transform of such step functions. 




Figure 9.2 



9.1 DEFINITION, EXISTENCE, AND BASIC PROPERTIES OF THE LAPLACE TRANSFORM 425 



0 2 5 


Figure 9.3 


► Example 9.1 7 

Consider the step function defined by 

0, 0 < t < 2, 

/(f) = 3, 2 < t < 5, 

0, t > 5. 

The graph of / is shown in Figure 9.3. We may gxpress the values of / in the form 

0-0, Oct <2, 

/(t) =3-0, 2 < t < 5, 

3-3, t > 5. 

Hence we see that / is the function with values given by 

|0, 0 < t < 2, 

(3, t > 2, 

minus the function with values given by 

[0, 0 < t < 5, 

{3, t > 5. 

Thus f(t) can be expressed as the linear combination 

3 u 2 {t) - 3 u 5 (t) 

of the unit step functions u 2 and u 5 . Then using Theorem 9.2 and formula (9.21), we find 
if {/(t)} = if {3 u 2 (t) - 3u 5 (t)} = — - — = - [e~ 2 ° - <r 5s ], 

5 S S 

Another useful property of the unit step function in connection with Laplace 
transforms is concerned with the translation of a given function a given distance in the 
positive direction. Specifically, consider the function / with values f(t) defined for t > 0 
(see Figure 9.4a). Suppose we consider the new function that results from translating 
the given function / a distance of a units in the positive direction (that is, to the right) 
and then assigning the value 0 to the new function for t < a. Then this new function is 
defined by 

JO, 0 < t < a, 

\f(t -a), t> a 


(9.22) 
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0 , 0 < t < a, 

1, t > a 9 


(see Figure 9Ab). Then since the unit step function u a is defined by 

K(t) = 

we see that the function defined by (9.22) is u a {t)f(t — a). That is, 
u a (t)f(t - a) = 


0 , 0 < t < a, 

f{t - a), t> a 


(9.23) 


(note Figure 9Ab again). 

Concerning the Laplace transform of this function we have the following theorem. 



THEOREM 9.7 


Hypothesis. Suppose f is a function satisfying the hypotheses of Theorem 9.1 with 
Laplace transform F so that 

F(s) = j” e~*f(t) dt; 


and consider the translated function defined by 


- a) = 


0 , 

f(t - 4 


0 < t < a, 
t > a. 


(9.24) 


Conclusion. Then, 

that is , (9.25) 

&{u.(t)At - a)} = e~ as F(s). 
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Proof 


^{u a (t) f(t - a)} = J e s, u a (t)f(t - a) dt 

= | e~ sl ■ 0 dt + I e- s, f(t- 

J 0 Ja 

= - 


a) dt 


Letting t — a = t, we obtain 

* oo 

e~ st f(t — a) dt 
Ja 

Thus 


-r 


= e 


?-s( t+4 


a) dt. 


:) /(t) dt 


i: 


e-°'f(T)dT = e- a °2’{f(t)}. 


~ a)} = e-"Se{f(t)} = e~ as F(s). Q.E.D 

In applying Theorem 9.7 to a translated function of the form (9.24), one must be 
certain that the functional values for t > a are indeed expressed in terms of t — a. In 
general, this will not be so; and if it is not, one must first express these functional values 
for t > a so that this is so. We shall illustrate this in each of Examples 9.18 and 9.19. 


► Example 9.1 8 


Find the Laplace transform of 


gU) = 



3, 


0 < t < 5, 
t > 5. 


Before we can apply Theorem 9.7 to this translated function, we must express the 
functional values t — 3 for t > 5 in terms of t - 5, as required by (9.24). That is, we 
express t — 3 as (t — 5) + 2 and write 


0(0 = 


0, 

(t - 5) + 2, 


0 < t < 5, 
t > 5. 


This is now of the form (9.24), and we recognize it as 


u 5 (t)f(t - 5) = 


0, 

(t - 5) + 2, 


0 < t < 5, 
t > 5, 


where f(t) = t + 2, t > 0. Hence we apply Theorem 9.7 with f(t) = t + 2. Using 
Theorem 9.2 (the Linear Property) and formulas (9.2) and (9.3), we find 


F(s) = <£{t + 2} = + 2JS?{1} = ^ 
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Then by Theorem 9.7, with a = 5, we obtain 

^{u 5 (t)f(t - 5)} = e- 5 °F(s) = 

This then is the Laplace transform of the given function g(t). 


► Example 9.1 9 

Find the Laplace transform of 


g( t) = 


0, 


°<'< 2 ' 


sin t, t > 


Before we can apply Theorem 9.7, we must express sin t in terms of t — tt/ 2, as required 
by (9.24). We observe that sin t = cos (t — n/2) for all t, and hence write 


9(t) = 


0 , 




n 


cos ( t 2 * I , t>j. 


This is now of the form (9.24), and we recognize it as 

/ 

0 . 

I 

««/ 2 ~ n/2) = 


0 <‘<r 


. 71 \ 71 
COSlt-jj, !>-, 

where f(t) = cost , t > 0. Hence we apply Theorem 9.7 with f(t) = cost. Using 
formula (9.6) with b = 1, we obtain 

F(s) = &{cos t} = 

Then by Theorem 9.7, with a = 7r/2, we obtain 


5 2 + 1 


2 {g(t)} = &{u. l2 (t)f(t - n/2)} = 


se 


~{n/2 )s 


s 2 + 1 


We next obtain a result concerning the Laplace transform of a periodic function. A 
function / is periodic of period P, where P > 0, if f(t + P) = f(t) for every t for which / 
is defined. For example, the functions defined by sin bt and cos bt are periodic of period 
2 n/b. 


THEOREM 9.8 

Hypothesis. Suppose f is a periodic function of period P which satisfies the 
hypotheses of Theorem 9.1. 
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Conclusion. Then 


*{/«» - 

Proof. By definition of the Laplace transform, 




e~ st m dt. 


(9.26) 


(9.27) 


The integral on the right can be broken up into the infinite series of integrals 


m p rip r3P 

e~ st f(t) dt + e~ st f(t) dt + e~ st f(t) dt + 

Jo Jp J2P 


+ 


•(» + dp 


e~ st f(t) dt*" . (9.28) 


We now transform each integral in this series. For each n = 0, 1, 2, . . . , let t = u + nP in 
the corresponding integral 

*(» + i)p 

e~ st m dt. 


r 

Jnl 


Then for each n = 0, 1, 2, ... , this becomes 


r 


e - s(u+nP) f(u + nP) du. 


(9.29) 


But by hypothesis, / is periodic of period P. Thus f(u) = f(u + P) = f(u + 2 P) = 
••• = f(u + nP) for all u for which / is defined. Also e ~ s{u + nP) = e~ su e~ nPs , where the 
factor e~ nPs is independent of the variable of integration u in (9.29). Thus for each 
n = 0, 1, 2, ... , the integral in (9.29) becomes 


- nPs 


e~ su f(u) du. 
Hence the infinite series (9.28) takes the form 


i: 


+ e 


r 

i 


e su f{u) du + e Ps | e su f(u) du 

e~ su f(u) du + ■■■ + e~ nPs J <T s "/(u) du + --- 
= [1 +e- ps + e- 2Ps 


+ ••• + «' 


r 


+ •••] e~ su f(u) du. (9.30) 


Now observe that the infinite series in brackets is a geometric series of first term 1 and 
common ratio r = e ~ Ps < 1 . Such a series converges to 1 /( 1 — r), and hence the series in 
brackets converges to 1/(1 — e~ Ps ). Therefore the right member of (9.30), and hence 
that of (9.28), reduces to 

Jo *“/(“) du 


1 -e 


Ps 
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Then, since this is the right member of (9.27), upon replacing the dummy variable u by t, 
we have 


^{/w} = 


S p 0 e~*f(t)dt 
1 -e- ps 


Q.E.D. 


► Example 9.20 


f(t) = 


Find the Laplace transform of / defined on 0 < t < 4 by 

1, 0 < t < 2, 

• 1, 2 < t < 4, 
and for all other positive t by the periodicity condition 

f(t + 4) = f(t). 

The graph of / is shown in Figure 9.5. Clearly this function / is periodic of period 
P = 4. Applying formula (9.26) of Theorem 9.8, we find 




1 


1 — e 

1 


-4s 




i - <r 4s 
l 

l -<r 4s 
1 - 2e~ 2s 


'(!)* + 
2 r,~St 


e“ s, (-l )dt 


; lt-« 


+ e 


-As 


s(l -e“ 4s ) 
— e~ 


~ 2s + 1 + - e“ 2s ] 

(1 — e~ 2s ) 2 

"s(l — e“ 2s )(l + e ~ 2s ) 


J _ s 


s(l -f e 2s ) 


Exercises 


Find if {/(t)} for each of the functions / defined in Exercises 1-18. 


1. 



0 < t < 6, 
t > 6. 


2. f(t) = 


0, 0 < t < 10, 

-3, t > 10. 


A 
1 - 

T" 

-i - 






j 


Figure 9.5 
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3- fit) 

5- f(t) 

7- fit) 

9. f(t) 

11. At) 
13. At) 
15. At) 

17. At) 


4, 

0, 

0 < t < 6, 
t > 6. 

4. 

/(«) = < 

1 

\2, 0 < t < 5, 

[0, t > 5. 

0, 

0 < t < 5, 



f 0, 0 < t < 3, 

2, 

5 < t < 7, 

6. 

At) = ■ 

-6, 3 < f < 9, 

0, 

r> 7. 



O 

V 

VO 

1, 

0 < t < 2, 



[9, 0 < t < 5, 

2, 

3, 

2 < t < 4, 

4 < t < 6, 

8. 

At) = ■ 

6, 5 < t < 10, 

3, 10 < t < 15, 

0, 

t > 6. 



0, t > 10. 

2, 

0 < t < 3, 



4, 0 < t < 5, 

0, 

3 < t < 6, 

10. 

At) = • 

0, 5 < t < 10, 

2, 

t > 6. 



3, t > 10. 

0, 

t. 

0 < t < 2, 

r> 2. 

12. 

At) = | 

\ 0, 0 < t < 4, 

[3t, t > 4. 

t, 

3, 

0 < t < 3, 
t > 3. 

14. 

At) = | 

i 2r, 0 < t < 5, 

[10, t > 5. 

0 , 

cos 

0 < t < ji/2, 
t, t > n/2. 

16. 

At) = | 

1 0, 0 < t < 2, 

1 e~‘, t > 2. 

0 , 

0 < t < 4. 



6, 0 < t < 1, 

t - 

4 4 < r < 7, 

18. 

At) = ' 

8-2 1, 1 < t < 3, 

3, 

r > 7. 



2, t > 3. 


9.2 THE INVERSE TRANSFORM AND THE CONVOLUTION 
A. The Inverse Transform 


Thus far in this chapter we have been concerned with the following problem: Given a 
function /, defined for t > 0, to find its Laplace transform, which we denoted by if {/} 
or F. Now consider the inverse problem: Given a function F, to find a function / whose 
Laplace transform is the given F. We introduce the notation if ~ 1 {F} to denote such a 
function /, denote if " 1 {F(s) } by /(f), and call such a function an inverse transform of 
F. That is, 

/(r) = if- 1 {F( s )} 

means that f(t) is such that 

^ {fit)} = F(s). 


Three questions arise at once: 


1. Given a function F, does an inverse transform of F exist? 

2. Assuming F does have an inverse transform, is this inverse transform unique? 

3. How is an inverse transform found? 
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In answer to question 1 we shall say “not necessarily,” for there exist functions F that 
are not Laplace transforms of any function /. In order for F to be a transform it must 
possess certain continuity properties and also behave suitably as s -► oo. To reassure 
the reader in a practical way we note that inverse transforms corresponding to nu- 
merous functions F have been determined and tabulated. 

Now let us consider question 2. Assuming that F is a function that does have an 
inverse transform, in what sense, if any, is this inverse transform unique? We answer 
this question in a manner that is adequate for our purposes by stating without proof the 
following theorem. 


THEOREM 9.9 

Hypothesis. Let f and g be two functions that are continuous for t > 0 and that have 
the same Laplace transform F. 

Conclusion. f(t) = g(t) for all t > 0. 

Thus if it is known that a given function F has a continuous inverse transform f then 
/ is the only continuous inverse transform of F. Let us consider the following example. 


► Example 9.21 


By Equation (9.2), { 1 } = 1 /s. Thus an inverse transform of the function F defined by 

F(s) = 1/5 is the continuous function / defined for all t by f(t) = 1. Thus by Theorem 
9.9 there is no other continuous inverse transform of the function F such that F(s) = 1 /s. 
However, discontinuous inverse transforms of this function F exist. For example, 
consider the function g defined as follows: 


Then 


g(t) = 


1 , 

2 , 


u. 


0 < t < 3 , 
t = 3 , 
t > 3 . 


^{g{t)} = J e s, g{t)dt = ^ e st dt 

T e -s(_ | 3 T e~ st 

= + lim I 

L 5 Jo 00 L 5 


+ 


R 

3 5 


dt 

if 5 > 0. 


Thus this discontinuous function g is also an inverse transform of F defined by 
F(s) = 1/s. However, we again emphasize that the only continuous inverse transform 
of F defined by F(s) = 1/s is / defined for all t by f(t) = 1. Indeed we write 



with the understanding that / defined for all t by f(t) = 1 is the unique continuous 
inverse transform of F defined by F(s) = 1/s. 
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Finally, let us consider question 3. Assuming a unique continuous inverse transform 
of F exists, how is it actually found? The direct determination of inverse transforms will 
not be considered in this book. Our primary means of finding the inverse transform of a 
given F will be to make use of a table of transforms. As already indicated, extensive 
tables of transforms have been prepared. A short table of this kind appears on page 
434. 

In using a table of transforms to find the inverse transform of a given F, certain 
preliminary manipulations often have to be performed in order to put the given F(s) in a 
form to which the various entries in the table apply. Among the various techniques 
available, the method of partial fractions is often very useful. We shall illustrate its use 
in Example 9.23. 


► Example 9.22 


Using Table 9.1, find if 1 


1 


s 2 -f 6s + 13 f ’ 


Solution. Looking in the F(s) column of Table 9.1 we would first look for F(s) = 

1 b 

. However, we find no such F(s); but we do find F(s ) = 


as 2 + bs + c 

(number 1 1). We can put the given expression 


1 


1 


1 


s 2 + 6s + 13 
1 2 


(s + a) 2 + b 2 
in this form as follows: 


s 2 + 6s+13 (s + 3) 2 + 4 2 (s + 3) 2 + 2 2 ' 

Thus, using number 1 1 of Table 9.1, we have 


jsr 


i 


s 2 + 6s + 13 


= -<e~ 1 
2 


(s + 3) + 2 A 


i 


! sin 2 1. 


► Example 9.23 


Using Table 9.1, find Ft? 1 


{s(s 2 + 1) 


Solution. No entry of this form appears in the F(s) column of Table 9.1. We employ 
the method of partial fractions. We have 

1 + Bs + C 

s(s 2 + 1) s + s 2 + 1 

and hence 


Thus 


1 = (A + B)s 2 + Cs + +. 


A + B = 0, 


C = 0, and A = 1. 
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TABLE 9.1 LAPLACE TRANSFORMS 



ST 

i 

ii 

F(s) = <£{f(t)} 

1 

i 

1 

s 

2 

e M 

1 

s - a 

3 

sin bt 

b 

s 2 +b 2 

4 

cos bt 

s 

s 2 + b 2 

5 

sinh bt 

b 

s 2 -b 2 

6 

cosh bt 

s 

s 2 -b 2 

7 

t n (n = 1,2,...) 

n\ 

s" + 1 

8 

tV'(n = 1,2,...) 

n! 

(s — fl)” + 1 

9 

t sin frr 

26s 

(s 2 + 6 2 ) 2 

10 

t cos bt 

s 2 - 6 2 
(s 2 + 6 2 ) 2 

11 

e~ at sin bt 

6 

(s + a) 2 + 6 2 

12 

e~ at cos bt 

s + a 

(s + a) 2 + 6 2 

13 

sin bt — bt cos bt 

1 

26 3 

(s 2 + 6 2 ) 2 

14 

t sin bt 

s 

2b 

(s 2 + 6 2 ) 2 

15 

K(t) 

e~ as 

s 


[see equations 
(9.19) and (9.21)] 


16 

~ «) 

[see Theorem 9.7] 

e~ as F(s) 
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From these equations, we have the partial fractions decomposition 

1 1 s 


s(s 2 + i) s s 2 + r 


Thus 


jsr 


1 


s(s 2 + 1) 


= if" - if 


s 2 + 1 


By number 1 of Table 9.1, if 1 { 1/s} = 1 and by number 4, if ^/(s 2 + 1)} = cos t. 
Thus 


£T 


1 


s(s 2 + 1) 


= 1 — cos t. 


We now give two examples of finding the inverse transform of a function that 
involves one or more terms of the form e~ as F(s). 


► Example 9.24 

Find 




Solution. By number 1 of Table 9.1, we at once have 




By number 15, we see that 


— } = u a U). 


(9.31) 


Here u a is the unit step function [see Equations (9.19) and following] defined for a > 0 
by 


ujt) = 


0, 0 < t < a, 

1, t > a, 


(9.32) 


and for a = 0 by 

u 0 (t) =1 for t > 0. 

Applying (9.31) and (9.32) with a = 3 and a = 7, respectively, we have 


if 


- i 


- 3s 


and 


J&T 


= « 3 (t) = 


U 7 (t) = 


0, 0 < t < 3, 

1, t > 3, 

0, 0 < t < 7, 

1, t > 7. 


(9.33) 


(9.34) 
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Thus we obtain 


5 3e" 3s 2e~ ls 


(S S S J 

Now using (9.33) and (9.34), we see that this equals 

5 — 0 — 0, 0 < t < 3, 
• 5- 3-0, 3 < t < 7, 
5-3-2, t > 7; 

and hence 


= 5 - 3 u 3 (t) - 2u 7 (t). 


5 3g~ 3 * 2e~ 7s 

s s s 


5, 0 < t < 3, 
= • 2, 3 < t < 7, 
0, t > 7. 


► Example 9.25 
Find 


■ 


Solution. This is of the form if l {e ‘“F(s)}, where a = 4 and F(s) = 2/s 2 + 5/s. 
By number 16 of Table 9.1, we see that 

&- l {e-° s F(s)} = u a (t)f(t-a). (9.35) 

Here u a is the unit step function defined for a > 0 by (9.32) and f(t) = SF~ l {F(s)} [see 
Theorem 9.7]. By number 1 of Table 9.1, we again find if -1 {1/s} = 1; and by 
number 7 with n = 1, we obtain if “ 1 { 1/s 2 } = t. Thus 


fit ) = Se~ l {F(s)} = ^ _1 {^ + 7 1 = 2r + 5, 

and so f{t — 4) = 2(r — 4) + 5 = 2t — 3. Then by (9.35), with a = 4, 

2?- 1 {e- 4 ’F( S )} = u 4 (t)f(t- 4); 

that is, 

f .J 2 5\] „ f0, 0 < t < 


y-'V S 71 + - = u 4 (t)[2t - 3] = 


0, 0 < t < 4, 
2t - 3, t > 4. 


Exercises 

Use Table 9.1 to find if ~ l {F(s)} for each of the functions F defined in Exercises 1-28: 
L F(S) ~?T9' 1 FU> " s J - 4' 

3 - 4 - T«- 
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5. 

7. 

9. 

11 . 

13. 

15. 

17. 

19. 

21 . 

23. 

25. 

27. 


F(s) = 

F(s) = 

m = 

F(s) = 
F(s) = 
F(s) = 
F(s) = 
F(s) = 
F(s) = 

m = 

F(s) = 

F(s) = 


s + 2 

6. 

F(s) 


s+ 10 

s 2 + 4s + 7 


s 2 + 8s + 20 ' 

1 

8. 

F(s) 


s + 1 

s 3 + 4s 2 + 3s * 


s 3 + 2s* 

S ~h 3 

10. 

F(s) 


s + 5 

(s 2 + 4) 2 


s 4 + 3s 3 + 2s 

5 

12. 

F(s) 


7 

(s + 2) 5 ’ 


(2s + l) 3 ' 

2s + 7 

14. 

F(s) 


8(s + 1) 

(s + 3 r 


(2s- l) 3 ' 

s — 2 

16. 

F(s) 


2s + 6 

s 2 + 5s + 6 


8s 2 — 2s — 3 

Is + 12 

18. 

F(s) 


5s + 8 

s 2 + 9 ’ 


s 2 + 3s — 10 

5s + 6 

20. 

F(s) 


s+ 10 

s 2 + 9 


s 2 + 2s — 8 € 

S + 8 c -(ks)I2 

22. 

F(s) 


2s + 9 

s 2 + 4s + 13 


s 2 + 4s + 13 * 

i 

£ 

1 

i 

24. 

F(s) 


1 

u> 

05 

1 

1 

00 

05 

s 2 


s 3 

1 +e~* s 

26. 

FI [s) 


2 — c " 3s 

s 2 + 4 


s 2 + 9 

2[1 + e - iKS)l2 ] 
s 2 — 2s + 5 

28. 

F(s) 

= 

4(e “ 2s - 1) 
s(s 2 + 4) 


B. The Convolution 

Another important procedure in connection with the use of tables of transforms is that 
furnished by the so-called convolution theorem which we shall state below. We first 
define the convolution of two functions / and g. 


DEFINITION 

Let f and g be two functions that are piecewise continuous on every finite closed interval 
0 < t < b and of exponential order. The function denoted by f*g and defined by 

f(t)*g(t) = ^mg(t-x)dT (9.36) 

is called the convolution of the functions f and g. 
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Let us change the variable of integration in (9.36) by means of the substitution u = 
t — t. We have 




f(z)g(t - t) dx 


i: 


*0 

Jt 


f(t - u)g(u) du 


= 9{u)f(t — u)du = g(t ) * f(t). 

0 


Thus we have shown that 


f*g = g*f (9.37) 

Suppose that both / and g are piecewise continuous on every finite closed interval 
0 < t < b and of exponential order e at . Then it can be shown that /* g is also piecewise 
continuous on every finite closed interval 0 < t < b and of exponential order e <a+€)t , 
where e is any positive number. Thus if {/*#} exists for s sufficiently large. More 
explicitly, it can be shown that if {/* g } exists for s > a. 

We now prove the following important theorem concerning <?{f*g}. 


THEOREM 9.10 


Hypothesis. Let the functions f and g be piecewise continuous on every finite closed 
interval 0 < t < b and of exponential order e at . 

Conclusion 

2’{f*g}=J?{f}&{g} (9.38) 

for s > a. 


Proof. By definition of the Laplace transform, if {/ * g} is the function defined by 


| - T) dx dt. 


The integral (9.39) may be expressed as the iterated integral 

e~ st f(x)g(t — t) dx dt. 


i: 


Further, the iterated integral (9.40) is equal to the doyble integral 


II 

R i 


e st f(x)g(t - x)dx dt, 


(9.39) 


(9.40) 


(9.41) 


where R t is the 45° wedge bounded by the lines x = 0 and t = x (see Figure 9.6). 
We now make the change of variable 


u = t — T, 

v = t, (9.42) 

to transform the double integral (9.41). The change of variables (9.42) has Jacobian 1 
and transforms the region R x in the t, t plane into the first quadrant of the u, v plane. 
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Figure 9.6 


Thus the double integral (9.41) transforms into the double integral 



S(u+V) f(v)g(u) du dv, 


Rl 


(9.43) 


where JR 2 is the quarter plane defined by u > 0, v > 0 (see Figure 9.7). 
The double integral (9.43) is equal to the iterated integral 


e S{u+V) f(v)g(u)dudv. 

o Jo 


But the iterated integral (9.44) can be expressed in the form 


e~ sv f(v) dv 


U g(u) du. 


(9.44) 


(9.45) 


But the left-hand integral in (9.45) defines Jz?{/} and the right-hand integral defines 
S£{g). Therefore the expression (9.45) is precisely & {f} {g}. 

We note that since the integrals involved are absolutely convergent for s > a, the 
operations performed are indeed legitimate for s > a. Therefore we have shown that 


*g} =&{/}& {g} for 5 > a. Q.E.D. 


"4 



Figure 9.7 
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Denoting $£ {/} by F and if { g } by G, we may write the conclusion (9.38) in the form 

<?{f(t)*g(t)} = F(s)G(s). 

Hence, we have 


JST ‘{F(s)G(s)} = f(t)*g(t) = I' f(x)g(t - x )dx, 
and using (9.37), we also have 

J?- l {F(s)G(s)}=g(t)*m= P gW(t - t) dx. 


(9.46) 


(9.47) 


Suppose we are given a function H and are required to determine if “ 1 { H(s) }. If we 
can express H(s) as a product F(s)G(s), where if ” 1 { F(s)} = f ( t ) and if ~ 1 { G(s)} = g(t) 
are known, then we can apply either (9.46) or (9.47) to determine if ~ l {H(s)}. 


► Example 9.26 


Find if 


- 1 


s(s 2 + 1) 


using the convolution and Table 9.1. 


Solution. We write l/s(s 2 + 1) as the product F(s)G(s), where F(s) = 1/s and 
G(s) = l/(s 2 + 1). By Table 9.1, number 1, f(t) = if -1 {l/s} = 1, and by number 3, 
g(t) = if _1 {l/(s 2 + 1)} = sint. Thus by (9.46), 


se- 


i 


s(s 2 + 1) 


= m*g(t) = 


1 • sin ( t — t) dx, 


and by (9.47), 


if 


1 


s(s 2 + 1) 




= g(t)*f{t)= sinr-ldr. 


The second of these two integrals is slightly more simple. Evaluating it, we have 

1 


if 


= 1 - cos t. 


(s(s 2 + 1) ' 

Observe that we obtained this result in Example 9.23 by means of partial fractions. 

Exercises 

In each of Exercises 1-6 find if ~ 1 {H(s)} using the convolution and Table 9.1. 

1 1 


1. H(s) = ~ 2 — ^ 

s T 5s "I - 6 

1 


3. H(s) = 


5. H(s) = 


s(s 2 + 9)' 

1 

s 2 (s + 3)’ 


2. H(s) = 
4. H(s) = 
6. H(s) = 


s 2 + 3s — 4" 

1 


s(s 2 + 4s + 13)' 
1 

(s + 2)(s 2 + 1)' 
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9.3 LAPLACE TRANSFORM SOLUTION OF LINEAR DIFFERENTIAL 
EQUATIONS WITH CONSTANT COEFFICIENTS 

A. The Method 

We now consider how the Laplace transform may be applied to solve the initial-value 
problem consisting of the nth-order linear differential equation with constant 
coefficients 

d n v d nl v dv 

U °^dt" + Ui dt n ~ l + + a " _1 + a " y = (9.48) 

plus the initial conditions 

y{ 0) = Co, /(0) = c t ,. . . , ‘>(0) = c„_ t . (9.49) 

.Theorem 4.1 (Chapter 4) assures us that this problem has a unique solution. 

We now take the Laplace transform of both members of Equation (9.48). By 
Theorem 9.2, we have 

+ "■ + + = *W)}- < 9 - 5 °) 

We now apply Theorem 9.4 to 



in the left member of Equation (9.50). Using the initial conditions (9.49), we have 
^ - s"~ l y(0) - s"- 2 /(0) - - - y^yO) 

= s n ^{y(t)} - c 0 s " -1 - C|S n_ 2 - ••• - c„_,, 

*{£<} = s n - 1 ^ , {y(t)} - s’-yo) - 5 "-y(o) - ••• - y n - 2 >( 0 ) 

= s n -^{y(t)} - c 0 s n - 2 - Cl s n ~ 3 -■■■- c„_ 2 , 


& {f } = s#{y(t)} - y(0) = s<?{y(t)} - c 0 . 

Thus, letting 7(s) denote if {y{t)} and B(s) denote if {b(t)}, Equation (9.50) becomes 

[a 0 s B + a^" -1 + ••• + a„_ t s + ajy(s) 

- CoEaos" -1 + «i s " 2 + •• + a„-i] 

- Ci\_a 0 s "~ 2 + a l s n ~ 3 + ••• + a„_ 2 ] 

- ••• - c„- 2 [a 0 s + a,] - = B(s). (9.51) 

Since b is a known function of t, then B , assuming it exists and can be determined, is a 
known function of s. Thus Equation (9.51) is an algebraic equation in the “unknown” 
7(s). We now solve the algebraic equation (9.51) to determine 7(s). Once 7(s) has been 
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found, we then find the unique solution 

y(t) = &~ l {Y(s)} 

of the given initial-value problem using the table of transforms. 

We summarize this procedure as follows: 

1. Take the Laplace transform of both sides of the differential equation (9.48), 
applying Theorem 9.4 and using the initial conditions (9.49) in the process, and 
equate the results to obtain the algebraic equation (9.51) in the “unknown” Y(s). 

2. Solve the algebraic equation (9.51) thus obtained to determine Y(s). 

3. Having found Y(s), employ the table of transforms to determine the solution 
y(t) = if _1 { Y(s)} of the given initial- value problem. 


B. Examples 

We shall now consider several detailed examples that will illustrate the procedure 
outlined above. 


► Example 9.27 


Solve the initial-value problem 




(9.52) 


y( 0) = 3 


(9.53) 


Step 1 . Taking the Laplace transform of both sides of the differential equation 
(9.52), we have 

J?l^-2&{y(t)} = J?{e 5 '}. (9.54) 

Using Theorem 9.4 with n = 1 (or Theorem 9.3) and denoting if {y(t)} by Y(s), we may 
express J?{dy/dt} in terms of Y(s) and y(0) as follows: 


Applying the initial condition (9.53), this becomes 


if® = sY(s) - 3. 

(dr J 


Using this, the left member of Equation (9.54) becomes sY(s) — 3 — 2 Y (s). From 
Table 9.1, number 2, $£ {e 5 '} = l/(s — 5). Thus Equation (9.54) reduces to the algebraic 
equation 


[s - 2] y(s) - 3 = — 

s — J 


(9.55) 


in the unknown T(s). 
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Step 2. We now solve Equation (9.55) for Y(s). We have 

[s - 2] Y(s) = 

S — J 

and so 


Y(s) = 


3s- 14 
(s-2)(s — 5)’ 


Step 3. We must now determine 


3£~ x 

We employ partial fractions. We have 

3s- 14 


3s — 14 
(s - 2)(s - 5) 


A B 
+ ■ 


(s - 2)(s - 5) s - 2 s - 5’ 
and so 3s — 14 = .4(s — 5) + B(s — 2). From this we find that 

A = f and B = j, 

and so 


jsr 


3s- 14 


(.(s — 2)(s — 5) J 3 

Using number 2 of Table 9.1, 




s — 2 3 


s- 5 


$£ M — f— 1 = e 2t and £P 


Is -2| I s - 5 

Thus the solution of the given initial-value problem is 




y = f e 2t + | e 5t . 


► Example 9.28 

Solve the initial-value problem 



(9.56) 

y(0) = 3, 

(9.57) 

/(0) = 6. 

(9.58) 


Step 1 . Taking the Laplace transform of both sides of the differential equation 
(9.56), we have 



-8jS?{y(t)} = ^{0}. 


(9.59) 
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Since i?{0} = 0, the right member of Equation (9.59) is simply 0. Denote $£ {y(0} by 
Y(:). Then, applying Theorem 9.4, we have the following expressions for $£ {d 2 y/dt 2 } 
and $£ {dy/dt} in terms of Y(s), y(0), and j/'(0): 

- s’l'i'i - - m 

^-sYM-AO). 

Applying the initial conditions (9.57) and (9.58) to these expressions, they become: 

^{§} = s2y(s)_3s_6 ’ 

Now, using these expressions, Equation (9.59) becomes 

s 2 T(s) - 3s - 6 - 2sT(s) + 6 - 8Y(s) = 0 
or 

[s 2 - 2s - 8] Y(s) - 3s = 0. (9.60) 

Step 2. We now solve Equation (9.60) for Y (>-). We have at once 


Y(s) = 


Step 3. We must now determine 


JST 


3s 


(s - 4)(s + 2)' 


3s 


((s - 4)(s + 2) 
We shall again employ partial fractions. From 


3s 


A B 
+ 


(s — 4)(s + 2) s — 4 s + 2 
we find that A = 2, B = 1. Thus 


<£- 


3s 


l(s - 4 )(s + 2) 
By Table 9.1, number 2, we find 

1 


= l<£~ 


1 


s — 4 


+ j sr 1 


s + 2 


jsr 


s — 4 


= e 4 ' and if 1 


1 


s + 2 


= e 


Thus the solution of the given initial-value problem is 

y = 2e 4 ' + e~ 2 ‘. 
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► Example 9.29 

Solve the initial-value problem 


+ y = e 21 sin t, (9.61) 

at 

y( 0) = 0, (9.62) 

y'(0) = 0. (9.63) 

Step 1. Taking the Laplace transform of both sides of the differential equation 
(9.61), we have 


d 2 y 


^^} + ^{y(t)}=^{e- 2 t smt}. 


(9.64) 


Denoting S£ (y(r)} by T(s) and applying Theorem 9.4, we express J 2 ?{d 2 y/df 2 } in terms 
of Y(s), y(0). and y'(0) as follows: 


JS? 


d 2 y 

1? 


= s 2 T(s) - sy(0) - y'(0). 


Applying the initial conditions (9.62) and (9.63) to this expression, it becomes simply 


<£ 



s 2 T(s); 


and thus the left member of Equation (9.64) becomes s 2 7(s) + 7(s). By number 11, 
Table 9.1, the right member of Equation (9.64) becomes 


1 

(s + 2) 2 + r 


Thus Equation (9.64) reduces to the algebraic equation 


(s 2 + l)T(s) = 


1 

(s + 2) 2 + 1 


(9.65) 


in the unknown Y (s). 

Step 2. Solving Equation (9.65) for T(s), we have 

7(S) = (s 2 + l)[(s + 2) 2 +Tf 
Step 3. We must now determine 

i 1 

l(s 2 + l)[(s + 2) 2 + 1] J • 


We may use either partial fractions or the convolution. We shall illustrate both 
methods. 
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1 . Use of Partial Fractions. We have 

1 As 4" B Cs 4~ D 

(s 2 4- l)(s 2 4- 4s 4- 5) s 2 4- 1 + s 2 4- 4s 4- 5 ' 

From this we find 

1 = (As 4- B)(s 2 4- 4s 4- 5) 4- (Cs 4- D)(s 2 4- 1) 

= (A + C)s 3 + (4A + B + D)s 2 + ( 5A + 4B + C)s 4- (SB + D). 
Thus we obtain the equations 

A 4- C = 0, 

4 A 4- B + D = 0, 

5A + 4B + C = 0, 

5B + D = 1. 

From these equations we find that 

A = -l B = i C = i D = i 

and so 



s 2 4- 4s 4- 5 


+ 7;^- 


s 2 4- 4s 4- 5 
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Now using Table 9.1, numbers 4, 3, 12, and 1 1, respectively, we obtain the solution 
y(t) = — | cos t + £ sin t + %e~ 2 ‘ cos t + je~ 2t sin t 


1 e 21 

y(t) = - (sin t - cos t ) + — — (sin t + cos f). 
8 8 


2. Use of the Convolution. We write — j — — : — as the product 

(s 2 + l)[(s + 2) 2 + 1] F 

F(s)G(s), where F(s) = -J - and G(s) = T -. By Table 9.1, number 3, 

s 2 + 1 (s + 2) 2 + 1 

/(f) = ^~ J-J— } = sinr, and by number 11, g(t ) = & ~ 1 {■ 1 } = 


5 2 -f 1 


(s + 2) 2 + 1 


e 2 ‘ sin t. Thus by Theorem 9.10 using (9.46) or (9.47), we have, respectively, 


(5 2 + 1 )[(s + 2) 2 + 1] 


= /(O*0(O = sini-e 2( * T) sin(r-i)dT 


‘ {(s 2 + l)[(s + 2) 2 + 1] J = 0(t) * /(0 = J/ _2t sinT • sin ( f - *> *■ 

The second of these integrals is slightly more simple; it reduces to 

r t rt 

(sin t) e~ 2z sin z cos t dz - (cos £) e _2t sin 2 t dz. 

Jo Jo 

Introducing double-angle formulas this becomes 

sin f r _ 7r , . cos t _ 7r . cos r ~ ^ , 

— T— e 2t sin 2i dz — e 2 dz + — — e 2t cos 2 t dz. 

2 Jo 2 Jo 2 Jo 

Carrying out the indicated integrations we find that this becomes 

[ €~ 2t * cos t r n* €~ 2t 1 

— — (sin 2 t + cos 2t) H — — e~ 2x + cos t — — (sin 2x — cos 2 t) 

° Jo 4 [_ Jo l_ 8 J 0 

e~ 2t . . . _ . _ x sin t e~ 2t cos t cost 

= — (sin t sin 2 1 + sin t cos 2 1) H 1 

8 8 4 4 

— 2t 

& ^ 1 COS t 

H — — (cos t sin It — cos t cos It) H — — . 

8 8 

Using double-angle formulas and simplifying, this reduces to 

1 , . e~ 2t . 

- (sin t - cos t) H — — (sin t + cos t), 

8 8 


which is the solution (9.68) obtained above using partial fractions. 
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► Example 9.30 


Solve the initial-value problem 

3 y 


i i 3 


d 2 ' 


3+4 -J~2 + 5 ~T~ + — 10 cos t, 

dt 3 dt 2 dt 


y(0) = o, 
y'( 0 ) = o, 
y"(0) = 3. 


(9.69) 

(9.70) 

(9.71) 

(9.72) 


Step 1 . Taking the Laplace transform of both sides of the differential equation 
(9.69), we have 


d 3 y 


d 2 y 


dv 


dy 


2\-^\ + + + 2 2{y(t)} = ^if {cos t}. 


dt 


(9.73) 


We denote if {y(f)} by T(s) and then apply Theorem 9.4 to express 


d 3 y 


dt 3 


d 2 y 


* -nr , 2 , and if ^ 


dt 2 


dy 


dt 


in terms of Y (s), y(0), y'(0), and y"(0). We thus obtain 
if 

(d 2 y 




if 


dv 


s 3 7(s) - s 2 y(0) - sy'(0) - y"(0), 
= s 2 Y(s) - sy(0) - y'(0), 


if )Tf = s ^( s ) — y(0). 


Applying the initial conditions (9.70), (9.71), and (9.72), these expressions become 


if 


dv 


= s 3 Y(s)- 3, 




if ^ =sy ( s). 


Thus the left member of Equation (9.73) becomes 

s 3 7(s) - 3 + 4s 2 7(s) + 5s 7(5) + 2 Y(s) 


or 

By number 4, Table 9.1, 


[s 3 + 4s 2 + 5s + 2] 7(s) - 3. 


lOif {cos t } 


10s 

s 2 -f r 
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Thus Equation (9.73) reduces to the algebraic equation 

10s 

(s 3 + 4s 2 + 5s + 2) Y (s) - 3 = 

s 2 4- 1 

in the unknown 7(s). 


Step 2. We now solve Equation (9.74) for 7(s). We have 


(s 3 + 4s 2 4- 5s + 2)7(s) = 


3s 2 4~ 10s 4~ 3 
s 2 -f 1 


(9.74) 


or 


Y(s) = 


3s 2 + 10s + 3 

(s 2 + 1)(s 3 + 4s 2 + 5s + 2)‘ 


Step 3. We must now determine 


if " 1 



3s 2 4" 10s 3 

l)(s 3 + 4s 2 + 5s + 2) 


Let us not despair! We can again employ partial fractions to put the expression for 7(s) 
into a form where Table 9.1 can be used, but the work will be rather involved. We 
proceed by writing 

3s 2 + 10s + 3 3s 2 + 10s + 3 

(s 2 + l)(s 3 + 4s 2 + 5s + 2) (s 2 + l)(s + l) 2 (s + 2) 


From this we find 


A B C Ds -f E 

s + 2 + s+l + (s+l) 2 + s 2 +r 


(9.75) 


3s 2 + 10s + 3 = A(s 4- l) 2 (s 2 4- 1) 4- B(s 4- 2)(s 4- l)(s 2 4- 1) 

+ C(s 4- 2)(s 2 + 1) + (Ds + E)(s + 2)(s + l) 2 , (9.76) 


or 

3s 2 4- 10s 4- 3 = (A 4- B 4- D)s 4 4- (2/4 4- 3D 4- C 4- AD 4- E)s 3 
4- (2/4 4- 3D 4- 2C 4- 5D 4- 4 E)s 2 
4” (2/4 4” 3J 3 4" C 4" 2 D 4” 5 D)s 4" (A 4" 2J 3 4” 2 C 4” 2 E). 

From this we obtain the system of equations 

A 4” B 4" D = 0, 

2/4 + 3J3 4- C 4- 4D + E = 0, 

2/4 4- 3£ 4- 2C 4- 5D 4- 4£ = 3, (9.77) 

2/4 4- 3J3 4- C 4- 2D 4- 5£ = 10, 

A 4- 2B 4- 2C 4- 2£ = 3. 

Letting s = — 1 in Equation (9.76), we find that C = — 2; and letting s = -2 in this 
same equation results in A = — 1. Using these values for A and C we find from the 
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system (9.77) that 

B = 2, D = — 1, and E = 2. 

Substituting these values thus found for ,4, B , C, D, and £ into Equation (9.75), we see 
that 


jsr 


3s 2 + 105 + 3 


(s 2 + l)(s 3 + 4s 2 + 5s + 2) 


= -irN— y + 2ir l$ 1 


5 + 2 


s + 1 


- 2 J 2?- 1 

+ 2jsr 1 


i 


(5 + l) 2 
1 

S 2 + 1 


jsr 


s 2 + 1 


Using Table 9.1, numbers 2, 2, 8, 4, and 3, respectively, we obtain the solution 
y(t) = —e~ 2t + 2e~ l — 2 te~ x — cos t + 2sinr. 


► Example 9.31 


Solve the initial-value problem 


where 


d 2 y , t dy 
dt 2 + dt 


+ 5y = h(t). 



0 < t < n, 
t > n. 


y( 0) = o, 


/(0) = o. 


(9.78) 


(9.79) 

(9.80) 

(9.81) 


Step 1 . We take the Laplace transform of both sides of the differential equation 
(9.78) to obtain 

+ 2JS? + 5#{y(t)} = &{h(t)}. (9.82) 

Denoting £f{y(t)} by 7(s), using Theorem 9.4 as in the previous examples, and then 
applying the initial conditions (9.80) and (9.81), we see that the left member of Equa- 
tion (9.82) becomes [s 2 + 2s + 5]T(s). By the definition of the Laplace transform, 
from (9.79) we have 

f°° C n re~ s n* 1 — e~ ns 

&{h(t)}= J e~ st h(t)dt= I = 

Thus Equation (9.82) becomes 

1 - e~ ns 

[s 2 + 2s + 5]y(s) = 


s 


(9.83) 
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Step 2. We solve the algebraic equation (9.83) for 7(s) to obtain 


Y(s) = -T 2 


s(s 2 + 2s + 5) 


Step 3. We must now determine 


s(s 2 + 2s + 5) 


Let us write this as 


s(s 2 + 25 + 5) 


s(s 2 + 25 + 5)] ’ 


and apply partial fractions to determine the first of these two inverse transforms. 
Writing 

1 A Bs + C 

5 ( 5 2 + 25 + 5) 5 5 2 + 25 + 5 ’ 

we find at once that A B = —\,C= — f. Thus 

C£-' { 1 1 = 1 j^-dll _lj£>-i{ - t -1 1 

1s(s 2 + 2s + 5) j 5 Isj 5 j(s + l) 2 + 4J 




-L-t 

sj 5 )(s + l) 2 + 4 


Sf-'l 

10 j(s + l) 2 + 4 


1 1 „ 1 . . 
= - — - e cos It - — e 'sin 2 1, 


using Table 9.1, numbers 1,12, and 11, respectively. Letting 


F (s) = -r-j 

Cl 


5(5 2 + 25 + 5) 


we thus have 


We now consider 


f(t) = j-je 'cos 2 1 sin It, 


^- l {F(s)} = f(t). 


By Theorem 9.7, 


s(s 2 + 2s + 5) 


= <£- ‘{r’Tfs)}. 


<£ l {e ns F(s)} = u n (t)f{t - n), 
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where 


-n) = 


0 < t < n, 


f(t — 7C), t> 71. 


s(s 2 + 25 + 5)] 

JO, 0 < t < 71, 

Is - ie~ {t ~ n) cos 2 (t - n) - Toe~ {t ~ n) sin 2 (t - n), t > n 
JO, 0 < t < n, 

(j - cos 2 r - sin 2t, t > n. 


Thus the solution is given by 


= ^ _1 {s(s 2 + 2s + 5)J = f(t) ~ U " it)f{t ~ n) 

_ (j — ^ e~ l cos 2 f - j§ e ~' sin 2t — 0 , 0 < t < n, 

(5 — cos 2 1 — j^e" 1 sin 2 1 — j cos 2 1 + sin It, t > n, 


t[('- 


|^cos It + - sin 2tj J, 0 < t < n, 

l)cos 2 1 + ( ~~ 2 ~~ ) sin 2t , t > n. 


Exercises 


Use Laplace transforms to solve each of the initial- value problems in Exercises 1-18: 


l. ~-y = e 3 ', y(0) = 2. 




y( 0) = i, /(0) = 2 . 

d 2 y 

5 jJ + 4,- 8, 


y(0) = o, /(0) = 6. 

7. Lj - - 2y = 18e~ ( sin 3f, 

df 2 df 


2 . -f- + y = 2 sin t, y( 0 ) = — 1 . 
at 

4. 2y = n 

dt 2 dt 

y(0) = 4, y'(0) = — l. 

6 . i -pL + 2 i i + 5y- 0 , 

dr df 

>’(0) = 2, y'(0) = 4. 

8 % + 


y(0) = 0, /(0) = 3. 


y(0) = l, /( 0 ) = o. 
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9. ti + 7 t+ I0y = 4r<r 3 ', i< 

dr dt 

y(0) = 0, y'(0) = - 1. 

- 5 + 7 ~ - 3^ = 20 sin t, 

dt 3 df 2 dt 

y(0) = 0, y'(0) = 0, /'( 0)=-2. 

dt 3 dr 2 dr 

y(0) = - 1, /(0) = 0, /'(0) = — 6. 


10 , ^- 8 ^ + IS,- 9«“, 
dr dt 

y(0) = 5, /(0) = 10. 


13 U - + 2y = m 


y(0) = o, /( 0 ) = o. 

, 4 . % + 


y(0) = 0, /( 0) = 0. 

y(0) = o, /( 0) = l. 

I6 - $+«£+<»-*«>, 


y(0) = !, /(0) = — l. 

d 2 v 

!7. + 4y = *(*), 

>-( 0 ) = 2 , >'( 0 ) = 0 . 
d 2 v 

18 . -jpr + y = WO> 

>*(0) = 2, >(0) = 3. 


where h(t) = 


where h(t ) = 


where h(t ) 


where h(t) = 


where h(t) = 


where h(t) = 


2, 0 < t < 4, 
0, t > 4, 


6 , 0 < t < 2 , 

0 , t > 2 , 

1, 0 < t < 7i/2, 
0, t > 7i/2, 


3, 0 < f < 271, 

0 , t > 2n, 

— 4t + 871 , 0 < t < In , 
0 , t > ' In , 


t, 0 < t < n, 

71, t > 7T, 
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A. The Method 


We apply the Laplace transform method to find the solution of a* first-order system 

dx dy 

ai it + a2 it + “ 3X + a * y = ^ 

bi + b 2 + b 3 x + b^y = fi 2 it). 


(9.84) 
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where a i , a 2 , a 3 , a 4 , b t , b 2 , b 3 , and b 4 are constants and j8 1 and /? 2 are known functions, 
that satisfies the initial conditions 

x(0) = c i and y(0) = c 2 , (9.85) 

where c l and c 2 are constants. 

The procedure is a straightforward extension of the method outlined in Section 9.3. 
Let X(s) denote $£ {*(£)} and let Y(s) denote $£ {)>(£)}. Then proceed as follows: 

1. For each of the two equations of the system (9.84), take the Laplace transform of 
both sides of the equation, apply Theorem 9.3 and the initial conditions (9.85), and 
equate the results to obtain a linear algebraic equation in the two “unknowns” X(s) 
and Y(s). 

2. Solve the linear system of two algebraic equations in the two unknowns X(s) and 
Y(s) thus obtained in Step 1 to explicitly determine X(s) and Y(s). 

3. Having found X(s) and Y(s), employ the table of transforms to determine the 
solution x(t) = <?~ l {X(s)} and y(t) = ^~ 1 {Y(s)} of the given initial-value 
problem. 



(9.89) 
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Using Table 9.1, number 2, we find 

$£{%e 1 } = — and if {Ae 1 } = — . 

5—1 5—1 

Thus, from (9.89) and (9.90), we see that Equations (9.88) become 

sX(s) + 1 - 6AT(s) + 37(s) = -i- 

s - 1 

sY(s) - 2X(s) - y(s) = 

S — 1 

which simplify to the form 

(s — 6)X(s) + 37(s) = — — 1, 
s — 1 

-2X(s) + (s- l)7(s) — i-, 

S — 1 

or 

(5 - 6)X(s) + 3 Y(s) = ^±1, 
s — 1 

-2X(s) + (s — l)Y(s) = — 

s — 1 


(9.90) 


(9.91) 
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and 


We first find x(t). We use partial fractions and write 

-5 + 7 A B 

(s — l)(s — 4) 5 — 1 + s — 4* 

From this we find 


A = — 2 and B = 1. 


Thus 

and using Table 9.1, number 2, we obtain 

x(t)= —2e l + e At . (9.92) 

In like manner, we find y(t). Doing so, we obtain 

y(t) = -je‘ + i e 4 '. (9.93) 

The pair defined by (9.92) and (9.93) constitute the solution of the given system (9.86) 
that satisfies the given initial conditions (9.87). 


Exercises 

In each of the following exercises, use the Laplace transform to find the solution of the 
given linear system that satisfies the given initial conditions. 


£ + y= 3e 2 ', 

2. 

£- 2 ,.o. 



l + x-3,-2, 

x(0) = 2, y(0) = 0. 


x(0) = 3, y(0) = 0. 

^ - 5x + 2 y = 3e 4 ‘, 

4. 

^ - 2x - 3y = 0, 
dt 

%- 4x + ,,0, 


dy 

- + X + 2,.,. 

x^ 

o 

II 

yj 

o 

II 

p 


x(0) = - 1, y(0) = 

dx 

--t X + 2y = 2,, 

6. 

+ x + y = 5c 2( , 
dt 

ll 

+ 

X 

oo 

1 


1 

L/l 

X 

1 

Xs 

II 

1 

OJ 

K) 

x(0) = 3, y( 0) = 5. 


x(0) = 3, y(0) = 2. 
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_ ^ dx dy 

dx dy 

1 - + - + 2x + y^e‘, 

x(0) = 2, y(0)=l. 

„ „ dx , dy 

9. 2 — + 4 -f- + x — y = 3e, 
dt dt 

dx dy 

- + - + 2x + 2, = e‘, 

x(0) = 1, y( 0) = 0. 

d 2 x dx dv 

io. it- 3 t + t + 2x -) ; = 0 j 

dt 2 dt dt 
dx dy 

_ + 2 X + > = 0, 


„ . dx dy 

8. 2 — — |- — — f- x + 5y — 4 1, 

dt dt 

dx dy 

- + _ + 2x + 2y = 2, 
x(0) = 3, y(0)=-4. 


x(0) = 0, y(0) = — 1, x'(0) = 0. 
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CHAPTER TEN 

Existence And Uniqueness Theory 


In certain of the preceding chapters we stated and illustrated several of the basic 
theorems of the theory of differential equations. However, in these earlier chapters our 
primary interest was in methods of solution, and our interest in theory was secondary. 
Now that we have become familiar with basic methods, we shall direct our attention to 
the more theoretical aspects of differential equations. In particular, in this chapter we 
shall state, prove, and illustrate existence and uniqueness theorems for ordinary 
differential equations. In order to fully understand the proofs of these theorems, we 
shall need to be familiar with certain concepts of real function theory. Since the reader 
may not be familiar with all of these topics, our first section will be devoted to a brief 
survey of some of them. 


10.1 SOME CONCEPTS FROM REAL FUNCTION THEORY 
A. Uniform Convergence 

We assume at once that the reader is familiar with the most basic concepts and results 
concerning functions of one real variable. To be specific, he should be familiar with 
continuous functions and their properties, and also with the basic theory of 
differentiation and integration. 

This much assumed, let us recall the definition of convergence of a sequence {c„} of 
real numbers. 

DEFINITION A 

A sequence {c„} of real numbers is said to converge to the limit c if given any e > 0, there 
exists a positive number N such that 

\c„-c | <6 

for all n > N. We indicate this by writing lim c n = c. 

n-* oo 
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► Example 10.1 


Let c n = n/(n + 1), n = 1, 2, 3, 


Then lim 

n-*ao 11 + 1 


= 1. For, given 6 > 0, 


• c = 


n 


1 


1 


n + 1 


< 6 


\n + 1 

provided n > (1/e) — 1. Thus, given e > 0, there exists N = ( 1 /e) — 1 such that 

n 


n + 1 


1 


< 6 for all n > N. 


Hence we conclude that the given sequence converges to 1. 


Let us now consider a sequence {/„} of real functions, each of which is defined on a 
real interval a < x < b. Let x be a particular real number belonging to this interval. 
Then {/„(*)} is simply a sequence of real numbers. If this sequence {/„(*)} of real 
numbers converges to a limit /(x), then we say that the sequence {/„} of functions 
converges at this particular x. If the sequence {/„} converges at every x such that 
a < x <b 9 then we say that the given sequence of functions converges pointwise on the 
interval a < x < b. More precisely, we have: 


DEFINITION B 

Let {/„} be a sequence of real functions , each function of which is defined for all x on a 
real interval a < x < b. For each particular x such that a < x < b consider the 
corresponding sequence of real numbers {/„(x)}. Suppose that the sequence {f„{x)} 
converges for every x such that a < x < b 9 and let f{x) = lim f n (x) for every x e [ a , b], 

n-> oo 

Then we say that the sequence of functions {/„} converges pointwise on the interval 
a < x < b, and the function f thus defined is called the limit function of the sequence 

{/„}• 


► Example 1 0.2 

Consider the sequence of functions {/„} defined for all x on the real interval 0 < x < 1 
by 


m = 


nx 

nx + 1 ’ 


0 < x < 1 


(« = 1.2, 3,...). 


The first three terms of this sequence are the functions f x , f 2 , and / 3 defined, 
respectively, by 


/iM = 


x 

x + r 




2x 

2x + r 


f 3 (x) = 


3x 

3x + 1 


For x = 0, the corresponding sequence {/„( 0)} of real numbers is simply 0, 0, 0,..., 
which obviously converges to the limit 0. For every x such that 0 < x < 1, the 
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corresponding sequence of real numbers is 


/„(*) = 


nx 

nx + r 


Since 


n~* oo nx + 1 

for every such x, we see that the sequence { /„ } converges to / such that f(x) = 1 for 
0 < x < 1. Thus the sequence {/„} converges pointwise on 0 < x < 1, and the limit 
function is the function / defined by 

m = o, 

f(x) =1, 0 < x < 1. 

The graphs of the functions f u / 5 , / 10 , defined, respectively, by 


/xW = 


X 

x + r 


fs(x ) = 


5x 

5x + r 


and 


/ioW = 


10x 

iox + r 


as well as that of the limit function /, appear in Figure 10.1. 

Now suppose that each term of a sequence {/„} of real functions which converges 
pointwise on a < x < b is a continuous function on a < x < b. Can we conclude from 
this that the limit function / will also be continuous on a < x < bl The answer to this 
equation is “no,” for under such circumstances / might be continuous or it might not 
be. For instance, in Example 10.2, each f n is continuous on 0 < x < 1, and the sequence 
{/„} converges pointwise on 0 < x < 1. However, the limit function / is such that 
/(0) = 0 but /(x) = 1 for 0 < x < 1; thus this limit function is certainly not continuous 



Figure 10.1 
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on 0 < x < 1. To be certain that the limit function of a sequence of continuous 
functions is itself continuous we shall need a “stronger” type of convergence than mere 
pointwise convergence. One such type of convergence is uniform convergence, which we 
shall now introduce. 

In order to introduce the concept of uniform convergence, let us turn back to the 
definition of pointwise convergence and recast it in the more basic language of 
Definition A. We have: 


DEFINITION B' 


Let {/„} be a sequence of real functions , each of which is defined on the real interval 
a < x < b. The sequence {/„} is said to converge pointwise to f on a < x < b 9 if , given 
any e > 0, for each x such that a < x < b there exists a positive number N (in general 
depending both on e and on x) such that 

I fn(x) - /Ml < e 


for every n> N. 


Now observe carefully that in general the number N of this definition depends not 
only upon e but also on x. For a given €, different numbers N(x) are in general required 
for different points x. However, if, given e > 0, there exists one single N such that 

I/.M “/Ml < € 

for every x such that a < x < b for every n> N, then we say that the convergence is 
uniform on a < x < b. More precisely, we state-: 


DEFINITION C 

Let {/„} be a sequence of real functions, each of which is defined on the real interval 
a < x <b. The sequence {/„} is said to converge uniformly to f on a < x < b if, given 
any e > 0, there exists a positive number N ( which depends only upon e) such that 

I /„(*) - f(x) I < e 

for alln> N for every x such that a < x < b. 

Geometrically, this means that given e > 0, the graphs of y = f n (x) for all n > N all 
lie between the graphs of y = f(x) + e and y = f(x) — efora < x < b (see Figure 10.2). 


► Example 1 0.3 

Consider the sequence of functions {/„} defined for all x on the real interval 0 < x < 1 
by 

nx^ 

f„(x) = — — J , 0 < x < 1 (w= 1,2,3,...). 

For x = 0, the corresponding sequence {/„(0)} of real numbers in 0, 0, 0,..., and 
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lim /„(0) = 0. For every x such that 0 < x < 1, we have 

n-> co 

flX ^ 

f n (x) = — —r and lim fjx) = x. 

YIX T" 1 n oo 

Thus the sequence {/„} converges point wise on 0 < x < 1 to the limit function / 
defined by f(x) = x, 0 < x < 1. Further, the convergence is uniform on 0 < x < 1. To 
see this, consider 


I L(X) - /(X) | 


nx 2 

nx + 1 


X 

nx + 1‘ 


Given e > 0, we have 


provided 


x 

T < 6 

nx + 1 

1 1 
n > . 

6 X 


But for x such that 0 < x < 1, we have 

i-is'-i 

e x e 

Thus if we choose N = (1/e) — 1, we have 

i i 

n > 

e x 


for all n> N. Hence, given e > 0, there exists TV = (1/e) — 1 (depending only upon e 
and not on x) such that | /„(x) — f(x) \ < e for all n > N for every x such that 0 < x < 1 . 
In other words, the convergence is uniform on 0 < x < 1. The graphs of the functions 
/i, /s, and fi 0 defined, respectively, by 


/iW = 


x + r 


fs(x) = 


5x 2 

5x + 1 ’ 


and / 10 (x) 


10x 2 
lOx + 1 ’ 


as well as that of f, appear in Figure 10.3. 
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We now state and illustrate two important theorems on uniformly convergent 
sequences, which we shall use in the existence proof in Section 10.2 We shall not prove 
these theorems here, since the proofs may be found in most texts on advanced calculus 
and real analysis. 


THEOREM A 
Hypothesis 

1. Let {/„} be a sequence of real functions which converges uniformly to f on the 
interval a < x < b. 

2. Suppose each function f n (n = 1, 2, 3, . . .) is continuous on a < x < b. 

Conclusion. The limit function f is continuous on a < x < b. 


► Example 1 0.4 

In Example 10.3 we saw that the sequence of functions {/„} defined on the real interval 
0 < x < 1 by 

nx^ 

m = (n = 1 ’ 2 ’ 3 ’" ) 

converges uniformly to a limit function / on 0 < x < 1. Further, each function 
f n (n= 1, 2, 3, . . .) is continuous on 0 < x < 1. By Theorem A we conclude at once that 
the limit function / is also continuous on 0 < x < 1. Indeed, in this example, the 
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limit function / is that defined by /(x) = x,0 < x < 1, and clearly this function 
/ is continuous on 0 < x <.l. 


THEOREM B 


Hypothesis 

1. Let {/„} be a sequence of real functions which converges uniformly to f on the 
interval a < x < b. 

2. Suppose each function f n (n = 1, 2, 3, . . .) is continuous on a < x < b. 

Conclusion. Then for every a and f such that a < a < /? < b 9 

rp rp 

lim f n (x) dx = lim f n (x) dx. 

n~>co J a J a n~* oo 


► Example 1 0.5 


We illustrate this theorem by again considering the sequence of functions {/„} 
discussed in Examples 10.3 and 10.4 and defined by 


/„(*) = 


nx 


nx + l 9 


0 < x < 1 (n = 1, 2, 3,...). 


The hypothesis of Theorem B is identical with that of Theorem A, and we have already 
noted in Example 10.4 that the sequence under consideration satisfies this hypothesis 
on 0 < x < 1. Thus we can conclude that 

lim f n (x) dx = lim f n (x) dx. 
co Jo Jo n->co 

Since lim f n (x) = f(x) = x in this example, our conclusion here is that 


lim P-=L*.P 
n-oojo^+l Jo 


We shall verify this directly. 


i: 


nx* 


nx + 1 


dx = 


1 

n n(nx 




x dx. 


dx 


x 2 x 1 1 1 1 ln(n + 1) 

= - + -j ln(nx + 1) = - - - + - 2 - — . 

2 n n 2 o 2 n n 2 


Thus 


lta r^L*, lta r> i 

n~* co Jo nx + 1 n-> co [ 2 n 


In (n + 1) 


Clearly x dx = j also, and our conclusion is thus verified. 
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Let us now consider briefly the uniform convergence of an infinite series of real 
functions, each of which is defined on a real interval a < x <b. 

DEFINITION D 


Consider the infinite series Y u n of real functions u n (n = 1, 2, 3,. . each of which is 

n = 1 

defined on a real interval a < x < b. Consider the sequence {/„} of so-called partial sums 
of this series, defined as follows: 

fi — u i> 

f 2 = u l + u 2 , 

f 3 = u l + u 2 + « 3 , 

/„ = W I + U 2 + U 3 + ■■■ + u n . 


00 

The infinite series Y is said to converge uniformly to f on a < x < bif its sequence of 

n = 1 

partial sums {/„} converges uniformly to f on a < x < b. 

The following theorem gives a very useful test for uniform convergence of series. 

THEOREM C Weierstrass M-Test 
Hypothesis 

1. Let {M n } be a sequence of positive constants such that the series of constants 

oo 

Y M n converges. 

n = 1 

oo 

2. Let Yj u n be a series of real functions such that | u n (x) | < M„ for all x such that 

n = 1 

a < x < b and for each n = 1 , 2, 3, — 

oo 

Conclusion. The series Y u n converges uniformly on a < x < b. 

n= 1 


► Example 1 0.6 


00 sin nx 

Consider the series Y — I — on i nterva l 0 < x < 1. The sequence {1 /n 2 } is a 
n = i n 

00 1 

sequence of positive constants which is such that the series Y ~ 2 1S convergent. Let us 

n=l n 

take M n = \/n 2 and observe that 


sin nx 


- n 2 ~ M " 


l«»WI = 
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for all x such that 0 < x < 1 and for each n = 1, 2, 3, Thus by Theorem C the series 

00 sin nx 

£ — 2 — converges uniformly on 0 < x < 1. 

n = 1 n 


B. Functions of Two Real Variables; the Lipschitz Condition 
DEFINITION E 

1. Aset of points A of the xy plane will be called connected if any two points of A can 
be joined by a continuous curve which lies entirely in A. 

2. A set of points A of the xy plane is called open if each point of A is the center of a 
circle whose interior lies entirely in A. 

3. An open , connected set in the xy plane is called a domain. 

4. A point P is called a boundary point of a domain D if every circle about P contains 
both points in D and points not in D. 

5. A domain plus its boundary points will be called a closed domain. 


► Example 10.7 

The set of all points (x, y) lying within the ellipse x 2 + 2y 2 = 1 and characterized by 
x 2 + 2y 2 < 1 is a domain D. The boundary points of D are the points of the ellipse it- 
self. The set of points (x, y) such that x 2 + 2y 2 < 1 within and on the ellipse is a closed 
domain. 

We assume that the reader is at least somewhat familiar with functions / of two real 
variables x and y, defined on a domain of the xy plane or on such a domain plus its 
boundary. Let us recall a few useful concepts and results. 


DEFINITION F 

Let f be a real function defined on a domain D of the xy plane , and let (x 0 , y 0 ) be an 
(interior) point of D. The function f is said to be continuous at (x 0 , y 0 ) if given any e > 0, 
there exists a 5 > 0 such that 


\f{x,y) - f(x 0 ,y 0 )\ <e 

for all (x, y)e D such that 

| x — x 0 1 < <5 and | y — y 0 1 < 


DEFINITION G 

Let f be a real function defined on D, where D is either a domain or a closed domain of the 
xy plane. The function f is said to be bounded on D if there exists a positive number M 
such that |/(x, y)| < M for all (x, y) in D. 
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THEOREM D 

Hypothesis. Let f be defined and continuous on a closed rectangle R: a < x < b 9 
c < y < d. 

Conclusion. Then the function f is bounded on R. 


► Example 10.8 

The function / defined by f{x 9 y) = x + y 2 is continuous on the closed rectangle 
R.O < x < 1, 0 < y < 2. Thus by Theorem D, the function / is bounded on R. In fact, 
we have |/(x, y)| = |x + y 2 \ < 5 for all (x, y) e R. 

Having disposed of these preliminaries, we now introduce a concept which will be of 
paramount importance in the existence and uniqueness proof of Section 10.2. 


DEFINITION H 

Let f be defined on D, where D is either a domain or a closed domain of the xy plane. The 
function f is said to satisfy a Lipschitz condition (with respect to y) in D if there exists a 
constant k > 0 such that 


\f(x,yi)~ f(x,y 2 )\ <.k\yi - y 2 1 (10.1) 

for every (x, y t ) and (x, y 2 ) which belong to D. The constant k is called the Lipschitz 
constant. 

Let us investigate the geometric significance of the Lipschitz condition. Let (x, yj 
and (x, y 2 ) be any two points in D having the same abscissa x. Consider the 
corresponding points 

P l : IX y\> fix, ^i)] and Pj-Ix, y 2 , f(x, y 2 )l 

on the surface z = /(x, y), and let a (0 < a < nil) denote the angle which the chord 
joining P i and P 2 makes with the xy plane. Then if condition (10.1) holds in D, tan a is 
bounded in absolute value and so the chord joining P l and P 2 is bounded away from 
being perpendicular to the xy plane. Further, this bound is independent of x for points 
(x, y x ) and (x, y 2 ) belonging to D . 

If / satisfies condition (10.1) in D, then for each fixed x the resulting function of y is a 
continuous function of y for (x, y) belonging to D. Note, however, that condition (10.1) 
implies nothing at all concerning the continuity of / with respect to x. For example, the 
function / defined by 

fix, y) = y + [X], 

where [x] denotes the greatest integer less than or equal to x, satisfies a Lipschitz 
condition in any bounded domain D. For each fixed x, the resulting function of y is 
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continuous. However, this function / is discontinuous with respect to x for every 
integral value of x. 


Now suppose the function / is such that -jj- exists and is bounded for all (x, y) e D. 

Then by the mean-value theorem of differential calculus, for any (x, j^) and (x, y 2 ) e D 
there exists £, where £ is between y { and y 2 , such that 


Thus 


fix, 3>i) - fix, y 2 ) = (yi - y 2 ) 


Qfjx, g) 

dy 


\f(x,yi)-f(x,y 2 )\ = \y 1 - y 2 1 


Sf(x, 0 

< lub 

dfix, y) 

dy 

dy 

(x,y)eD 


Ijt -y 2I, 


provided (x, £) g D. Thus we have obtained the following very useful result. 


THEOREM E 


M 


Hypothesis. Let f be such that — ■ exists and is bounded for all (x, y) e D , vvfcere D is a 

dy 

domain or closed domain such that the line segment joining any two points of D lies 
entirely within D. 


Conclusion. Then f satisfies a Lipschitz condition (with respect to y) in D, where the 
Lipschitz constant is given by 


k = lub 

(x,y)eD 


Vi*, y) 

dy 


► Example 1 0.9 

Consider the function / defined by f(x, y) = y 2 , where D is the rectangle defined by 

5/*/ \ A/» 

|x| < a, \y \ <b. Then - , ’ — = 2 y, and so -f- exists and is bounded for all (x, y) e D. 
oy dy 

Thus by Theorem E, the function / satisfies a Lipschitz condition in D, where the 
Lipschitz constant k is given by 2b. If we directly apply the definition of Lipschitz 
condition instead of Theorem E, we find that 

\fix,yi)~ fix,y 2 )\ = \y 2 - yi\ = \yi + y 2 \\yi- y 2 \<2b\y 1 - y 2 \ 
for all (x, y t ), (x, y 2 ) e D. 


Note that the sufficient condition of Theorem E is not necessary for / to satisfy a 
Lipschitz condition in D. That is, there exist functions / such that / satisfies a Lipschitz 
condition (with respect to y) in D but such that the hypothesis of Theorem E is not 
satisfied. 
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► Example 10.10 

Consider the function / defined by /(x, y) = x| j/|, where D is the rectangle defined by 
|x| <a,\y\<b. We note that 

\f(x,yi)~ f(x,y 2 )\ = |x|.yi| - x|y 2 || < 1*11 yi - y 2 \ ^ «lyi - y 2 \ 

for all (x, y x ), (x, y 2 ) e D. Thus / satisfies a Lipschitz condition (with respect to y) in D. 

rlf 

However, the partial derivative^- does not exist at any point (x, 0) e D for which x # 0. 

dy 


Exercises 


1. Consider the following sequences of functions {/„} which are defined for all x on 
the real interval 0 < x < 1. Show that each of these sequences converges uniformly 
on 0 < x < 1. 

(a) f„(x) = — , 0 < x < 1 (w = 1, 2, 3, . . .). 

x + n 

(b) /„(x) = x--, 0 < x < 1 («= 1,2,3,...). 

n 

2. Consider the sequence of functions {/„} defined for all x on the real interval 
0 < x < 1 by 

L(x) = — [ —, 0 < x < 1 («= 1,2,3,...). 

nx + 1 


(a) Show that the sequence {/„} converges pointwise to a function / on 
0 < x < 1. 

(b) Does f n converge uniformly to / on 0 < x < 1? 

(c) Does f n converge uniformly to / on \ < x < 1? 

3. (a) Prove the following theorem: 

Hypothesis. Let {/„} be a sequence of real functions defined on a < x < b, 
and suppose f n converges pointwise to a function / on a < x < b. Let {</>„} be 
a sequence of positive real numbers such that lim </>„ = 0. Finally, suppose 

n-+ oo 

that for each n = 1, 2, 3,..., |/(x) - /„(x)| < <fi n for all x e [ a , fc]. 

Conclusion. Then {/„} converges uniformly to / on a < x < b. 

(b) Use the theorem of part (a) to show that the sequence of functions {/„} defined 
by f n (x) = (sin nx)/n converges uniformly on 0 < x < 1. 

4. Prove Theorem A. 

5. Show that each of the following infinite series converges uniformly on every 
interval —a<x<a, where a is a positive number. 




(b) 

n= 1 n 


(C) z 


sin 2 nx 


i (2n - l)(2n + 1) 
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6. Show that each of the functions defined as follows satisfies a Lipschitz condition in 
the rectangle D defined by |x| < a, | y\ < b. 

(a) f(x,y) = x 2 + y 2 . 

(b) /(x, y) = x sin y + y cos x. 

(c) /(x,y) = xV + L 

(d) /(x, y) = A(x)y 2 + B(x)y + C(x), where A , £, and C are continuous on 
| x | < a. 

7. Show that neither of the functions defined in (a) and (b) satisfies a Lipschitz 
condition throughout any domain which includes the line y = 0. 

(a) f(x, y) = y 213 . 

(b) f(x, y) = J\y\. 

8. Consider the function / defined by f(x, y) = x| y\ for (x, y) e D , where D is the 

rectangle defined by | x | < a 9 \y \ < b. In Example 1 0. 1 0 it is stated that does not 

dy 

exist for (x, 0) e D , where x # 0. Show that this statement is valid. 


9. Consider the function / defined by 

4x 3 y 

f(x, y) = 


x 4 + y 2 y (X,y)*(0,0), 
0, (x, y) = (0, 0). 


Show that / does not satisfy a Lipschitz condition throughout any domain which 
includes (0, 0). 

IHint: Let a and b be real constants and consider \f(x, y t ) — f(x, y 2 )\ with 
(x, y L ) = (x, ax 2 ) and (x, y 2 ) = (x, fcx 2 ).] 


10.2 THE FUNDAMENTAL EXISTENCE AND UNIQUENESS THEOREM 
A. The Basic Problem and a Basic Lemma 

We now formulate the basic problem with which this chapter is primarily concerned. 


DEFINITION 

Let D be a domain in the xy plane and let (x 0 , y 0 ) be an ( interior ) point of D . Consider the 
differential equation 

A -f = f{x, y), (10.2) 

where f is a continuous real function defined on D. Consider the following problem . 

We wish to determine: 

1. an interval a < x < /? of the real x axis such that a < x 0 < /?, and 
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2. a differentiable real function </> defined on this interval [a, /?] and satisfying the 
following three requirements: 


(i) [x, 0(x)] g D, and thus /[x, </>(x)] is defined , /or a// x e [a, /?]. 
dcj>(x) 

(ii) — — = /[x, <£(x)], and thus (f> satisfies the differential equation (10.2), for all 

x e [a, /?]. 

(»0 0(*o) = 3V 


H4' s/ia// call this problem the initial-value problem associated with the differential 
equation (10.2) and the point (x 0 , y 0 ). We shall denote it briefly by 


tx =nx - y) ’ 


3'( x o) = >v 


(10.3) 


and call a function (j> satisfying the above requirements on an interval [a, /] a solution of 
the problem on the interval [a, /?]. 

If </> is a solution of the problem on [a, j8], then requirement (ii) shows that </> has a 
continuous first derivative </>' on [a, /]. 


In order to investigate this problem we shall need the following basic lemma. 


LEMMA 

Hypothesis 

1. Let f be a continuous function defined on a domain D of the xy plane. 

2. Let 4>be a continuous function defined on a real interval a < x < / and such that 
[x, <£(x)] g D for all x e [a, /]. 

3. Let x 0 be any real number such that a < x 0 < /. 

Conclusion. Then <f) is a solution of the differential equation dy/dx = f(x,y) on 
[a, /?] and is such that </>(x 0 ) = y 0 if and only if </> satisfies the integral equation 

m = yo + f m <t>m & ao.4) 

J*o 

for all x g [a, /?]. 

Proof. If <f) satisfies the differential equation dy/dx = f(x,y) on [a, /?], then 
d<f>(x)/dx = /[x, </>(x)] for all x g [a, /?] and integration yields at once 

<£(*) = I fiU <t>(t)h dt + c. 

Jxo 

If also <t>(x 0 ) = y 0 , then we have c = y 0 and so 0 satisfies the integral equation (10.4) for 
all x g [a, /?]. 
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Conversely, if (f satisfies the integral equation (10.4) for all x e [a, /?], then 
differentiation yields 

= fix, 4>(X)] 
dx 

for all x e [a, /?] and so satisfies the differential equation dy/dx = /(x, y) on [a, /?]; 
also Equation (10.4) shows that (j)(x 0 ) = y 0 . Q.E.D 


B. The Existence and Uniqueness Theorem and Its Proof 

We shall now state the main theorem of this chapter. 


THEOREM 10.1 


Hypothesis 

1. Let D be a domain of the xy plane, and let f be a real function satisfying the 
following two requirements: 

(i) f is continuous in D; 

(ii) f satisfies a Lipschitz condition (with respect to y) in D; that is, there exists a 
constant k > 0 such that 

\f(x,y l )~ f(x,y 2 )\ <k\ y x - y 2 \ (10.1) 

for all (x, yf), (x, y 2 ) e D. 

2. Let (x 0 , y 0 ) be an (interior) point of D; let a and b be such that the rectangle 
R: \x — x 0 \ < a, \ y - y 0 \ < b, lies in D; let M = max \f(x, y)| for (x, y) e R; and let 
h = min(a, b/M). 


Conclusion. There exists a unique solution </> of the initial-value problem 


dy_ 

dx 


= fix, y), 


y(*o) = y<» 


(10.3) 


on the interval |x — x 0 | < h. That is, there exists a unique differentiable real function (f> 
defined on the interval |x — x 0 | < h which is such that 


(0 


d(f(x) 

dx 


= /[x, </>(x)] for all x on this interval; 


(ii) 4>(x 0 ) = y 0 . 


Remarks. Since R lies in D, f satisfies the requirements (i) and (ii) of Hypothesis 1 in 
R. In particular, since / is thus continuous in the rectangular closed domain R, the 
constant M defined in Hypothesis 2 actually does exist. Let us examine more closely the 
number h defined in Hypothesis 2. If a < b/M, then h = a and the solution (f> is defined 
for all x in the interval \x — x 0 \ < a used in defining the rectangle R. If, however, 
b/M < a, then h = b/M < a and so the solution <f) is assured only for all x in the 



X 


b / b 

Figure 10.4 The case in which — < a so that h = min a, — 

M \ M 


M M' 


smaller interval |x — x 0 | < h < a associated with the smaller rectangle R { defined 
by |x - x 0 | < h < a, \ y — y 0 \ < b (see Figure 10.4). 

Method of Proof. We shall prove this theorem by the method of successive 
approximations. Let x be such that |x — x 0 | < h. We define a sequence of functions 

<t>U <t> 2 . 

called the successive approximations (or Picard iterants) as follows: 

4>Ax) = ,y 0 + I fit, y 0 ] dt, 


4> 2 (x) = y 0 + flu fi(t)] dt, 


<M*) = yo + fit, <t>i (03 dt. 


4>n(x) = y Q + fit, 4>„-i(t)~] dt. 

Jx 0 

We shall divide the proof into five main steps in which we shall show the following. 

1. The functions {<f> n j defined by (10.5) actually exist, have continuous derivatives, 
and satisfy the inequality | $ n {x) - y 0 1 < b on | x - x 0 1 < h; and thus /[x, 0„(x)] is 
defined on this interval. 
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2. The functions {</>„} satisfy the inequality 

M (khY 

- &-!(*) | <y — on |x-x 0 |<h. 

3. As oo, the sequence of functions {</>„} converges uniformly to a continuous 
function (f> on |x — x 0 | < h. 

4. The limit function </> satisfies the differential equation dy/dx = f(x,y) on 
|x — x 0 | < h and is such that </>(x 0 ) = y 0 . 

5. This function </> is the only differentiable function on |x — x 0 | < h which satisfies 
the differential equation dy/dx = /(x, y) and is such that 0(x o ) = y 0 . 


The Proof. Throughout the entire proof we shall consider the interval [x 0 , 
x 0 + h]; similar arguments hold for the interval [x 0 — h , x 0 ]. 


1. In this first step of the proof we use mathematical induction. Assume (f) n - l exists, 
has a continuous derivative, and is such that \ <l> n -i(x) — y 0 \ < b for all x such that 
x 0 < x < x 0 + h. Thus [x, (/> n -i(x)] lies in the rectangle R and so /[x, 0„-i(x)] is 
defined and continuous and satisfies 


l/[x, (/>„_ x (x)] | < M on [x 0 ,x 0 + fc]. 


Thus since 


= yo + 


fluh- 1(0] dt, 


we see that </>„ also exists and has a continuous derivative on [x 0 , x 0 + h]. Also, 


\4>n(x) ~ y 0 \ = 


f 


fit, (t>„-i{t)1 dt 


| flt,<t> n -,m\dt< Mdt 


I' 


= M(x — x 0 ) < Mh < b. 

Thus, also, [x, 0„(x)] lies in R and hence /[x, 0„(x)] is defined and continuous on 
[x 0 , x 0 + h]. Clearly 0! defined by 


<Pi(x) = y 0 + 


fit, yo] dt 


exists and has a continuous derivative on this interval. Also, 


l<M*)-y 0 l ^ 


I fit, y 0 ] I dt < M(x - x 0 ) < b 


and so /[x, 0! (x)] is defined and continuous on the interval under consideration. Thus, 
by mathematical induction, each function </>„ of the sequence (10.5) possesses these 
desired properties on [x 0 , x 0 + K]. 

2. In this step we again employ mathematical induction. We assume that 

Mk n ~ 2 

\<Pn-fx) - 0„-2(*)l - ( n _ !)j ( x - x o)" 1 on [Xo,X 0 + h]. (10.6) 
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I Hx) - 4> n -Ax)\ = |J {fit, - fit, dt 

< f \Rt, <t> n -Am- fit, <t>n~imdt. 

Jx 0 

Since by Step 1, \<p„(x) — y 0 | ^ hforallnon[x 0 ,x 0 + /i], the Lipschitz condition (10.1) 
applies with y j = 4>„^ 1 (x) and y 2 = ^„_ 2 (x), and we have 


I fit, - fit, <£„- 2 M]| ^ k|4>„-lM - <t>n- zMI- 


Thus we have 


| 0 W (X) — !(X)| < fc| i(0 — (t> n - 2 {t) \ dt. 


Now using the assumption (10.6), we have 


t>n(x) - <p n -i(x)\ < k 


*X \4bn-2 


,(»-!)! 


(t — x 0 )" 1 dt 


(» - 1)! 


(t-xo)"- 1 dt 


That is, we now have 


_ Mk n ~ 1 r (f-x 0 ) n ~| x _ Mfc"~ 1 
~ (n — 1)! L n _Lo nl 


Mk"~ l 

I 4>n(x) (x) | < (x - X 0 ) n , 


(x - x 0 y. 


which is precisely inequality ( 1 0.6) with (n — 1 ) replaced by n. When n = 1 , we have as in 
Step 1: 

\4>i(x) - y 0 \ < M(x-x 0 ). 

This is inequality (10.7) when n = 1. Thus by induction the inequality (10.7) is satisfied 
on [x 0 , x 0 + /i] for all n. 

Since 


in rv in 

(x - x 0 ) n < — h" = — 


M (kh) n 


we have 


I 4 >n(x) ~ 4 >n-l(x)\ < 


M (kh) n 


for n = 1, 2, 3, ... on [x 0 , x 0 + fc], 

3. Now the series of positive constants 


M * (kh) n Mkh M ( kh ) 2 M (kh) 3 

T n ^i ~nT ~ Til + T~2T + T~3T + 
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converges to 

~ Le kH - 1]. 

Also, the series 

E Wi( x ) - 0;-i(*)] 

i — 1 

is such that (10.8) is satisfied for all x on the interval x 0 < x < x 0 + A, for each n = 
1, 2, 3, Thus by the Weierstrass M- test (Theorem C), the series 

oo 

>> 0+1 I4>i(x) - 

i= 1 

converges uniformly on [x 0 , x 0 + A]. Therefore its sequence of partial sums {S n } 
converges uniformly to a limit function 0 on [x 0 , x 0 + A]. But 

S„(x) = y 0 + X [<Pi(x) ~ <t>i- iW] = <f>„(x). 

i = 1 

In other words, the sequence 0„ converges uniformly to 0 on [x 0 , x 0 + A]. Thus, since 
each 0„ is continuous on [x 0 , x 0 + A], Theorem A shows that the limit function 0 is also 
continuous on [x 0 , x 0 + A]. 

4. Since each 0„ satisfies |0„(x) — y 0 |<b on [*o> we a l so have 

1 — yo I ^ b on [x 0f x 0 + A]. Thus /[x, 0(x)] is defined on this interval and we can 
further apply the Lipschitz condition (10.1) with y x = 0(x) and y 2 = 0 n (x). Doing so, 
we obtain 


I fix, <t>(x)1 - fix, (/>„(*)] | < k\(t>{x) - (j)„(x) I (10.9) 


for xe[x 0 ,x 0 + A]. By Step 3, given e > 0, there exists N> 0 such that 
1 0(x) — 0„(x)| < e/k for all n > N and all x on [x 0 , x 0 + A]. Thus 

k\4>(x)~ 4>„(x)\<k(^j = e (10.10) 


for all n > N and all x on the interval under consideration. Thus from (10.9) and (10.10) 
we see that the sequence of functions defined for n— 1, 2, 3, ... by /[x, 0„(x)] converges 
uniformly to the function defined by /[x, 0(x)] on [x 0 , x 0 + A]. Also, each function 
defined for n = 1, 2, 3, ... by /[x, 0„(x)] is continuous on this interval. Thus Theorem B 
applies and 


4>(x) = lim </>„ +1 (x) = )>o + lim fit, <p n 
n~>o 0 n-oo Jx 0 

= ^0 + 


W] dt 

lim fit, </>„(£)] dt = y 0 + 


fit, 0(r)] dt. 


In short, the limit function 4> satisfies the integral equation 


4>(x) = y 0 + 


f fit, 4>(m dt 


xo 


on [x 0 , x 0 + A]. Thus by the basic lemma, the limit function 0 satisfies the differential 
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equation dy/dx = f(x , y) on [x 0 , x 0 + h ] and is such that 0(x o ) = y 0 - We have thus 
proved the existence of a solution of the basic initial-value problem (10.3) on the 
interval [x 0 , x 0 + h]. 

5. We now prove that the solution </> is unique . Assume that ij/ is another 
differentiable function defined on [x 0 , x 0 + h~\ such that 


#(*) 

dx 


= fix, \ A(x)] 


and i//(x 0 ) = y 0 . Then certainly 


l<AM - yo\<b 


(10.11) 


on some interval [x 0 ,x 0 + <5]. LetX! be such that |^(x) — y 0 1 < foforx 0 < x < Xj and 
|iA(Xi) — yo \ = b. Suppose Xj < x 0 + h. Then 


M 1 = 


'I'M-yo 

*1 -x 0 


b b „ 

> 7 > M. 

Xi — x 0 h 


But by the mean-value theorem there exists where x 0 < £ < x x , such that 


Mi = mt)\ = \m, * a (£)] i < m , 


a contradiction. Thus x { > x 0 + h and the inequality (10.11) holds for x 0 < x < 
x 0 + h , and so 


l*A(x) - y 0 \ < b (10.12) 

on the interval x 0 < x < x 0 + h. 

Since ^ is a solution of dy/dx = /(x, y) on [x 0 , x 0 + h] such that ij/(x 0 ) = y 0 , from 
the basic lemma we see that \// satisfies the integral equation 

'l'(x) = yo+\ fit, '/'(*)'] dt (10.13) 

J *o 

on [x 0 , x 0 + fc]. We shall now prove by mathematical induction that 

Mix) - <l>„(x)\ < - — - — (10.14) 

n! 


on [x 0 , x 0 + h~\. We thus assume that 

\ip(x) - 4>„-i(x ) | < 


k n - 1 b{x-x 0 ) n 


(n - 1)! 

on [x 0 , x 0 + h]. Then from (10.5) and (10.13) we have 


I <A(x) - <t>„(x ) | = 


f 

J x ( 


{fit, *A(r)] - /[t, </>„- iff)] } 


|/[t, *A(r)] -fit, 0„-,(f )]|<ft. 


(10.15) 


By Equation (10.12) and Step 1, respectively, the Lipschitz condition (10.1) applies with 
y x = i j/(x) and y 2 = <t> n ~ i(x). Thus we have 


- <£„(*)! 




fe|^(O-0«-i(t)l*- 
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By the assumption (10.15) we then have 

I •AM - <A«MI ss k 


C x k m ~ l b(t-x o)’ 

Jx„ (n - 1)! 
k n b ~ 

- 1)! L n _ 


dt 


k"b(x - x 0 )" 


n\ 


which is (10.15) with (n - 1) replaced by n. When n = 1, we have 


l<AM 


- <AiMI ^ [ 

JX( 


I f[t, <A(0] - f[t, y 0 ]\dt 


J * 


< k | {//(t) — y 0 \ dt < kb(x — x 0 ), 


which is (10. 14) for n = 1. Thus by induction the inequality (10. 14) holds for all n on [x 0 , 
x 0 + h]. Hence we have 


(kh) n 

l<AM-<A„MI ^ 


(10.16) 


for n = 1, 2, 3,... on [x 0 , x 0 + h]. 

oo (kh) n ( kh) n 

Now the series Y b - — — converges, and so lim b - — — = 0. Thus from (10.16), 
n = b n\ n^ao n\ 

ijj(x) = lim </>„(x) on [x 0 , x 0 + h]. But recall that </>(x) = lim 0„(x) on this interval. 

fi-»oo n-> oo 

Thus, 

•AM = <AM 


on [x 0 , x 0 + h]. Thus the solution 0 of the basic initial-value problem is unique on [x 0 , 

x 0 + h]. 

We have thus proved that the basic initial-value problem has a unique solution on 
[x 0 , x 0 + h]. As we pointed out at the start of the proof, we can carry through similar 
arguments on the interval [x 0 — h, x 0 ]. We thus conclude that the differential equation 
dy/dx = /(x, y) has a unique solution cj) such that </>(x 0 ) = y 0 on |x — x 0 | < h. 

Q.E.D. 


C. Remarks and Examples 

Notice carefully that Theorem 10.1 is both an existence theorem and uniqueness 
theorem. It tells us that 

if (i) / is continuous, and 

(ii) / satisfies a Lipschitz condition (with respect to y) in the rectangle R, 
then (a) there exists a solution 0 of dy/dx = /(x, y), defined on |x — x 0 | < K which 
is such that </>(x 0 ) = y 0 ; and 

(b) this solution cf) is the unique solution satisfying these conditions. 

Let us first concern ourselves with the existence aspect, Conclusion (a). Although the 
above proof made use of both Hypotheses (i) and (ii) in obtaining this first conclusion, 
other methods of proof can be employed to obtain conclusion (a) under Hypothesis (i) 
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alone. In other words, the continuity of / in R is a sufficient condition for the existence 
of solutions as described in conclusion (a). 

We now turn to the uniqueness aspect, conclusion (b). The continuity of / alone is not 
sufficient to assure us of a unique solution of the problem. That is, the function / must 
satisfy some additional condition besides the continuity requirement in order to obtain 
the uniqueness conclusion (b). One such additional condition which is sufficient for the 
uniqueness is of course the Lipschitz condition. Other less restrictive conditions are 
also known, however. For example, the Lipschitz condition (ii) could be replaced by the 
condition 


\f(x,yi) - fix, y 2 )\ <k\y 1 -j> 2 |lnr- —j 

1/1 — J2\ 

for all (x, y x ) and (x, y 2 ) in a sufficiently small rectangle about (x 0 , y 0 ), and uniqueness 
would still be assured. 


► Example 10.11 


Consider the initial-value problem 


dy_ 

dx 


,,i/3 


(10.17) 


y( 0 ) = 0 . 

Here /(x, y) = y 1,3 is continuous in the rectangle R:\x\ < a,\y \ < b about the origin. 
Thus there exists at least one solution of dy/dx = y 1/3 on |x| < h such that y(0) = 0. 

Existence of a solution of problem (10.17) being thus assured, let us examine the 
uniqueness aspect. If / satisfies a Lipschitz condition in R , then Theorem 10.1 will 
apply and uniqueness will also be assured. We have 


f(x,y i) - fix, y 2 ) 


yl^-yi ' 3 

yi -y 2 


yi - y 2 


If we choose y x = 5 > 0 and y 2 = —S, this becomes 


<5 x /3 — ( — <5)i/3 
S-(-S) 


1 

w 


Since this becomes unbounded as 5 approaches zero, we see that / does not satisfy a 
Lipschitz condition throughout any domain containing the line y = 0, and hence not in 
R. Thus we can not apply Theorem 10.1 to obtain a uniqueness conclusion here. On the 
other hand, we must not conclude that uniqueness is impossible simply because a 
Lipschitz condition is not satisfied. The simple truth is that at this point we can draw no 
conclusion one way or the other about uniqueness in this problem. 

In fact, the problem does not have a unique solution; for we can actually exhibit two 
solutions. Indeed, the functions </> x and </> 2 defined, respectively, by <Mx) = 0 for all x, 
and 


</> 2 (x) 


(|x) 3/2 , x > 0, 
0, x < 0, 


are both solutions of problem (10.17) on the interval — oo < x < oo. 
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We further observe that Theorem 10.1 is a “local” existence theorem or an existence 
theorem “in the small.” For it states that if / satisfies the given hypothesis in a domain 
D and if (x 0 , y 0 ) is a point of D , then there exists a solution </> of 


dy 


= fix, y). 


dx 
y(*o) = 


defined on an interval \x — x 0 \ < h, where h is sufficiently small. It does not assert that (j) 
is defined for all x, even if / satisfies the given hypotheses for all (x, y). Existence “in the 
large” cannot be asserted unless additional, very specialized restrictions are placed 
upon /. If / is linear in y, then an existence theorem in the large may be proved. 


► Example 10.12 


Consider the initial-value problem 



(10.18) 


y( 1 ) = - 1 - 

/ \ 

Here /(x, y) = y 2 and — -- - — = 2 y are both continuous for all (x, y). Thus using 

dy 

Theorem E we observe that / satisfies the hypothesis of Theorem 10.1 in every 
rectangle R , 

|x — 1 1 <a, \y + 1| < b, 
about the point (1, — 1) (see Figure 10.5). 



Figure 10.5 
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As in Theorem 10.1, let M = max |/(x, y)\ for (x, y) e R , and h = min (a, b/M). 
Then Theorem 10.1 asserts that the initial-value problem (10.18) possesses a unique 
solution defined on |x — 1| < h. 

Now in this case M = (— 1 — b) 2 = (b + l) 2 and so 

Now consider 


From 


m = 


b 

(b + l) 2 


F'(b) = 


1 - b 

{bTW’ 


we see that the maximum value of F(b ) for b > 0 occurs at b = 1; and we find F(l) = 5. 
Thus if a > 

wh?- a 

for all b > 0 and so 


regardless of the value of a. If, however, a <\, then certainly h < Thus in any case 
h < For b = 1, a > 

h = min 

This is the “best possible” h, according to the theorem. That is, at best Theorem 10.1 
assures us that the initial-value problem (10.18) possesses a unique solution on the 
interval f < x < f . Thus, although the hypotheses of Theorem 10.1 are satisfied for all 
(x, y), the theorem only assures us of a solution to our problem on the “small” interval 
\x - 1| <i 

On the other hand, this does not necessarily mean that the actual solution of problem 
(10.18) is defined only on this small interval and nowhere outside of it. It may actually 
be defined on a much larger interval which includes the “small” interval | x — 1 1 < \ on 
which it is guaranteed by the theorem. Indeed, the solution of problem (10.18) is readily 
found to be y = — (1/x), and this is actually defined and possesses a continuous 
derivative on the interval 0 < x < oo (see Figure 10.6). 


r(b + 1 ) 




min (a, i) = 


D. Continuation of Solutions 

Example 10.12 brings up another point worthy of consideration. In this problem, 
the solution y= —(1/x) was actually defined on the interval 0 < x < oo, while 
Theorem 10.1 guaranteed its existence only for | < x < f. We might thus inquire 
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Figure 10.6 


whether there is any way in which our theory could be applied to assure us that a 
solution is actually defined over a larger interval than that guaranteed by the basic 
application of Theorem 10.1. 

Suppose then that / satisfies the hypotheses of Theorem 10.1 in D and that 
(x 0 , y 0 ) e D. Let R : |x - x 0 | < a, |y — y 0 | < b be a rectangle lying in D which gives rise 
to the “best possible” h of the conclusion of Theorem 10.1. Then a single application of 
the Theorem 10.1 asserts that the initial-value problem 


dy_ 

dx 


= /(*, y), 


(10.19) 


y(* o) = y 0 , 

possesses a unique solution </> 0 on | x — x 0 1 < h, but nothing is implied about 0 O outside 
this interval. Now let us consider the extreme right-hand point for which we are thus 
assured that 0 O is defined. This is the point (x 1? y^, where x x = x 0 + h , y x = (f) 0 {x i). 
Since this point is a point of R , it is certainly a point of the domain D in which the 
hypotheses of Theorem 10.1 are satisfied. Thus we can reapply Theorem 10.1 at the 
point (x 1? )>!) to conclude that the differential equation dy/dx = /(x, y) possesses a 
unique solution such that <l>i(x l ) = y l , which is defined on some interval 
Xj < x < x l + h i9 where h { > 0. 

Now let us define 0 as follows: 

, , , f o(*)> x 0 -h<x<x 0 + h = x l , 

0(x) = < 

{<t>i(x), Xi < x < x { + h t . 

We now assert that 0 is a solution of problem (10.19) on the extended interval 
x 0 — h < x < x x + h x (see Figure 10.7). The function </> is continuous on this interval 
and is such that 0(x o ) = y 0 . For x 0 — h < x < x 0 + h we have 



X 


Figure 10.7 


and hence 


000 = y 0 + flU 0(0] dt 


on this interval. On the interval x 0 -f h < x < + h l7 we have 


0iOO = )h + /ft 0i (0] dt, 


(10.20) 


000 = yi + flu 0(0] dt. 


Since = 4>o(*i) = yo + /ft 0(0] dt, we thus have 


000 = yo + flu 0(0] dt 


(10.21) 


on the interval x 0 + h < x < x i + h x . Thus, combining the results of (10.20) and 
(10.21) we see that 0 satisfies the integral equation (10.21) on the extended interval 
x 0 — h < x < Xi + h { . Since 0 is continuous on this interval, so is /[x, 0(x)]. Thus, 


= f[_X, 0(x)] 


on [x 0 - h, Xj + h x ~\. Therefore 0 is a solution of problem (10.19) on this larger 
interval. 
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The function 0 so defined is called a continuation of the solution cj) 0 to the interval 
[x 0 — h,Xi + h^]. If we now apply Theorem 10.1 again at the point [xj + h l9 
0(xi + fci)], we may thus obtain the continuation over the still longer interval 
x 0 — h < x < x 2 + h 2 where x 2 = x t + and h 2 is positive. Repeating this process 
further, we may continue the solution over successively longer intervals x 0 — 
h < x < x n + h n extending farther and farther to the right of x 0 + h. Also, in like 
manner, it may be continued over successively longer intervals extending farther and 
farther to the left of x 0 — h. 

Thus repeating the process indefinitely on both the left and the right, we continue the 
solution to successively longer intervals \_a n , b n ~\, where 


[x 0 - h,x 0 + h] = [a 0 , ft 0 ] <= KA] <= [« 2 > t> i\ <= ••• <= K, bj <= •••. 


Let a = lim a„ and b — lim b n . 

n~* ao n~* oo 

We thus obtain a largest open interval a < x < b over which the solution 0 such that 
0(x o ) = yo may be defined. It is clear that two cases are possible: 

1. a = — oo and b = 4- oo, in which case </> is defined for all x, — oo < x < + oo. 

2. Either a is finite or b is finite or both. 

Note carefully that even if / is continuous and satisfies a Lipschitz condition in every 
bounded domain D , we still cannot conclude that (1) will necessarily be the case. The 
initial-value problem dy/dx = y 2 , y(l) = - 1 of Example 10.12 illustrates this. The 
solution of this problem (given by y = — [1/x]) is defined for 0 < x < oo and so a = 0, 
despite the fact that / such that f(x, y) = y 2 is continuous and satisfies a Lipschitz 
condition in every bounded domain. Note, however, that this / does not satisfy a 
Lipschitz condition in any unbounded domain defined bya<x<fc, — oo<y< + oo. 

On the other hand, suppose that for the initial-value problem dy/dx = f(x,y), 
y(x 0 ) = y 0 , the function / is continuous and does satisfy a Lipschitz condition in an 
unbounded domain a < x < b, — oo < y < + oo which includes (x 0 , y 0 ). Then we can 
be more definite concerning the largest open interval over which the solution of this 
initial-value problem is defined. In this connection we state without proof the following 
theorem. 


THEOREM 10.2 
Hypothesis 

1. Let f be continuous in the unbounded domain D: a < x < b, — oo < y < + oo. 

2. Let f satisfy a Lipschitz condition (with respect to y) in this unbounded domain. That 
is, assume there exists k > 0 such that 

\f(x,yi) ~ f(x,y 2 ) I < k\y l - y 2 \ 

for all (x, yj), (x, y 2 ) e D. 

Conclusion. A solution </> of dy/dx = /(x, y) such that (j>(x 0 ) = y 0 , where (x 0 , y 0 ) is 
any point of D, is defined on the entire open interval a < x < b. In particular, if a = — oo 
and b = + oo, then 0 is defined for all x, — oo < x < + oo. 



488 EXISTENCE AND UNIQUENESS THEORY 


For example, a solution of the initial-value problem 

^ = F(x)y, y(x 0 ) = y 0 , 

where F is continuous for — oo < x < + oo, is defined for all x, — oo < x < + oo. 


Exercises 

1. Consider the initial-value problem 

^- = y 4l \ y(x 0 ) = y 0 . 

(a) Discuss the existence of a solution of this problem. 

(b) Discuss the uniqueness of a solution of this problem. 

2. For each of the following initial-value problems show that there exists a unique 
solution of the problem if y 0 # 0. In each case discuss the existence and uniqueness 
of a solution if y 0 = 0. 

(a) ^r = y 2/3 ’ y(Xo) = Vo- 
te) ^j = n/T V\, jK*o) = JV 

3. For each of the following initial-value problems find the largest interval | x | < h on 
which Theorem 10.1 guarantees the existence of a unique solution. In each case 
find the unique solution and show that it actually exists over a larger interval than 
that guaranteed by the theorem. 

( a) £=l+y 2 , 3*0) = 0. 

(b) ^ = e 2 >, y(0) = 0. 

4. Show that Theorem 10.1 guarantees the existence of a unique solution of the 
initial-value problem 

Y- = x 2 + y 2 , y(0) = 0 

ax 

on the interval |x| < y/2/2. 


10.3 DEPENDENCE OF SOLUTIONS ON INITIAL CONDITIONS 
AND ON THE FUNCTION / 

A. Dependence on Initial Conditions 

We now consider how the solution of the differential equation dy/dx = /(x, y) depends 
upon a slight change in the initial conditions or upon a slight change in the function f It 
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would seem that such slight changes would cause only slight changes in the solution. 
We shall show that under suitable restrictions this is indeed the case. 

We first consider the result of a slight change in the initial condition y(x 0 ) = y 0 . Let 
/ be continuous and satisfy a Lipschitz condition with respect to y in a domain D , and 
let (x 0 , y 0 ) be a fixed point of D. Then by Theorem 10.1 the initial-value problem 




y(*o) = j>o> 

has a unique solution </> defined on some sufficiently small interval | x — x 0 1 < h 0 . Now 
suppose the initial y value is changed from y 0 to Y 0 . Our first concern is whether or not 
the new initial-value problem 


dy_ 

dx 


= fix, y), 


y(x o) = y 0 . 


( 10 . 22 ) 


also has a unique solution on some sufficiently small interval |x — x 0 | < h i . If Y 0 is 
such that | Y 0 — y 0 \ is sufficiently small, then we can be certain that the problem (10.22) 
does possess a unique solution on some such interval |x — x 0 | < h t . In fact, let the 
rectangle JR: | x — x 0 1 < a, \ y — y 0 \ < b, lie in D and let 7 0 be such that | Y 0 — y 0 1 < b/ 2. 
Then an application of Theorem 10.1 to problem (10.22) shows that this problem 
has a unique solution ijj which is defined and contained in R for |x — x 0 | < h l9 
where h t = min (a, b/2M) and M = max |/(x, y)\ for (x, y) e R. Thus we may assume 
that there exists <5 > 0 and h > 0 such that for each y o satisfying | Yq — y 0 1 < 6, problem 
(10.22) possesses a unique solution </>(x, Y 0 ) on \ x — x 0 \ < h (see Figure 10.8). 

We are now in a position to state the basic theorem concerning the dependence of 
solutions on initial conditions. 



Figure 10.8 
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THEOREM 10.3 


Hypothesis 

1. Let f be continuous and satisfy a Lipschitz condition with respect to y , with 
Lipschitz constant k , in a domain D of the xy plane; and let (x 0 , y 0 ) be a fixed point of D. 

2. Assume there exists 6 > 0 and h > 0 such that for each Y 0 satisfying | Y 0 — y 0 1 < 5 
the initial-value problem 


dy 


= f(x, y). 


dx 

y(x o) = Y 0 , 


( 10 . 22 ) 


possesses a unique solution 0(x, Y 0 ) defined and contained in D on\x — x 0 \ < h. 


Conclusion. If </> denotes the unique solution of (10.22) when Y 0 = y 09 and 0 denotes 
the unique solution of (10.22) when Y 0 = y 0 , where | JTq — 3; 0 1 = <5 1 < <5, then 

|0(x) — 0(x) | < S l e kh on |x - x 0 | < h. 

Thus the solution </>(x, 7 0 ) of problem (10.22) is a continuous function of the initial value 
Yq at Y 0 = y 0 . 


Proof. From Theorem 10.1 we know that 

0 = lim 

n-> oo 

where 

4>«(x) = (n= 1,2,3,...), 

and (f>o(x) = y 0 \ \x - x 0 | < h. 

In like manner, 

= lim 4> n , 

n~* oo 

where 

4>n(x) = y 0 +\ flu4> n -i{tj]dt (n= 1,2,3,...), 

Jjco 

and $ 0 (x) = y 0 ; |x - x 0 | < h. 

We shall show by induction that 

Ito- *■(*)! s A f (10.23) 

on [x 0 ,x 0 + h], where k is the Lipschitz constant. We thus assume that on 
[x 0 ,x 0 + h], 

1^-iW - ^-,(*)l < A E ( o) 

j=o )'■ 


(10.24) 
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Then 


\Ux) ~ hix) | = 


.Vo + 


[ 1 

J*0 

I 


(t)] dt - y 0 


j 1 

J*0 


m ^ 


^li'o->'ol+ I/O <l>n-l(t)'] -fit, (l>n-l(t)~\\dt. 
Jx 0 

Applying the Lipschitz condition, we have 

I/O, LlW] - /O, /„-l(*)]l ^ fe 1^,-xW - 0n-lMI; 
and so, since 1 7o — ^ol = 

I <£„(*) - /„MI < <5i + k I 0„-i(t) - <f)„-i(t ) | dt. 

Jx 0 

Using the assumption (10.24), we have 

\$n(x) ~ 4>n(x)\ < 


*,+4Vi 

J Xo j - 0 j • 

/i- 1 u p* 

= «i + Wi£- (f - x 0 ) J ' dt = Si 
j= o ;! Jx 0 


--i Q +1 (* - x 0 V +1 

j=o (; + !)! 


Since 


we have 


<5, 


1+ i 


0 +1 (x - x 0 ) J ’ 


+ r 


A (j + 1)! 


= I 


" 0(x - X 0 ) J ' 


J = 0 


I^W-^WI<^i I x °* 


“0 ;! 


which is (10.24) with (n — 1) replaced by n. 
Also, on [x 0 , x 0 -f h], we have 


l<M*) - <M*)I = 


h + /O *>] dt- y 0 - 


^,+r 

Jx ( 


r fit, 

Jxo 

1 


fit, y 0 ] dt 


^l/o-yol+ \fit,yol-fit,yol\dt 


k\y 0 - y 0 l dt = s i + kdi(x - x 0 ). 


Thus (10.23) holds for n = 1. Hence the induction is complete and (10.23) holds on 
[x 0 , x 0 + h]. Using similar arguments on [x 0 — h, x 0 ], we have 

,Zt\ j. , U ^ X v k J \ x - x o\ J ^ s V ( kh) j 

I <t>n(x) - <t>n(x)\ < Si X <^! i ~ 

i = 0 ]'■ j = 0 Jl 

for all x on |x — x 0 | <h,n= 1, 2, 3, Letting n -* oo, we have 

, (kh) j 


OO 
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oo 


But X 

j = o 


(kh) j 

j! 


= and so we have the desired inequality 


\4>(x) - </>(x)| < on |x — x 0 | < h. 


The remaining statement of the conclusion is now obvious. 


Q.E.D. 


Thus under the conditions stated, if the initial values of the two solutions and 
differ by a sufficiently small amount, then their values will differ by an arbitrarily small 
amount at every point of | x — x 0 1 < h. Geometrically, this means that if the 
corresponding integral curves are sufficiently close to each other initially, then they will 
be arbitrarily close to each other for all x such that |x — x 0 | < h. 


B. Dependence on the Function f 

We now consider how the solution of dy/dx = /(x, y) will change if the function / is 
slightly changed. In this connection we have the following theorem. 


THEOREM 10.4 


Hypothesis 

1. In a domain D of the xy plane , assume that 

(i) f is continuous and satisfies a Lipschitz condition with respect to y, with 
Lipschitz constant k. 

(ii) F is continuous. 

(Hi) \F(x, y) - fix, y)\ < e for (x, y) e D. 

2. Let (x 0 , y 0 ) be a point of D; and let 

(i) be the solution of the initial-value problem 




■K*o) = jv 

(ii) t// be a solution of the initial-value problem 


dy 

dx 


= F(x,y), 


y(*o) = J'o. 

(iii) [x, 0(x)] and [x, ^(x)] e D for |x — x 0 | < h. 


Conclusion. Then 

\4>(x) — ^(x)| < ^ (e kh — 1) on |x — x 0 | < h. 

Proof. Let fi 0 {x) = \j/(x) and define a sequence of functions {</>„} by 

= flt,(l>n-i(t)]dt, |x-x 0 | <h (n= 1,2,3,...). 

J*0 

Then0= lim is a solution of dy/dx = /(x, y) such that </>(x 0 ) = y 0 on |x — x 0 | < h. 

n~* oo 
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By Hypothesis l(i) the initial-value problem dy/dx =_/(x, y), y(x 0 ) = y 0 , has a unique 
solution on \ x — x 0 \ < h. Thus from Hypothesis 2(i) </S(x) = </>(x) on |x — x 0 | < h, and 
so lim (f) n = (j>. 


From Hypothesis 2(ii) we have 


<P(x) = y 0 + 


f Fit, 

Jx 0 


t//(t)]dt, lx-x 0 l<h. 


We shall show by induction that 


•, . c v ki ~ X °) J 

„(x) - i/s(x)l < e £ - 


J^i J 

on [x 0 , x 0 + h]. We thus assume that on this interval 


n-l Lj~l( y — y \J 
( o) 


j= i 


(10.25) 


(10.26) 


Then 


h(x) ~ = 


.Fo + 


/C0^-i(0] dt 




- yo - r Fit, 

Jx o 


mi ^ 


\f[tJ.-t(t)]-F[t, mil dt. 


We now write F(x, y) = /(x, y) + £(x, y). Then 


\l(x) - ip{x)\ < P i fit, i-Mi - fit, mi - sit, mil dt. 

Jx 0 

Applying the inequality \A — B\ < \A\ + |J3| and then the Lipschitz condition satis- 
fied by /, we have 


I <t>n(x) ~ >P(X)\ < 


i fit, k-i(m - fit, mil dt + m, mil dt 


f 


r x r x 

\4>«-i{t) - <A(0l dt + |<5[t, mil dt. 

Jx 0 Jx 0 


Now using the assumption (10.26) and the fact that 

l<5(x, y)| = |F(x, y) - f(x, y)| < e, 


we obtain 

Mx) - f(x)\ 


n- 1 fcj- 1 f*x £x 

< ke Yj ~ ry~ (t — *o) j dt + e dt 

j=l J' Jxo Jxo 

v 1 k J (x ~ X 0 ) i+1 f k j ~ l (x - x 0 ) j 

~‘£ (7 + 1)- +e <* -*•»-%?, j, ■ 

Thus (10.26) holds with ( n — 1) replaced by n. Also Hypothesis 1 (iii) shows that 


1, 


i<Mx) - +(x) i ^ if it, mi - Fit, mi i dt 


4 


6 dt = c(x — x 0 ) 


on [x 0 , x 0 + h]. Thus (10.25) holds for n = 1. Thus the induction is complete and so 
(10.25) holds on [x 0 , x 0 + h] for n = 1, 2, 3, 



494 EXISTENCE AND UNIQUENESS THEORY 


Using similar arguments on [x 0 — h, x 0 ], we thus have 


\<p n (x) - ij/(x)\ < e X 
j=i 


Z k J - l ix-Xnl j 


J’- 


€ n 


(kh) J 

7T 


for all x on | x — x 0 1 < h,n = 1, 2, 3, 

Letting n-* oo, we obtain 

\4>(x) - ip(x)\ <| £ 

*j=i 7 ! 

But £ — — = e kh — 1. Thus we obtain the desired inequality 
j=i J- 


1 0(x) - ^(x) | < | (£?** - 1) on | x - x 0 1 < h. Q.E.D 

Thus, under the hypotheses stated, if e is sufficiently small, the difference between the 
solutions (j) and if/ will be arbitrarily small on | x — x 0 1 < h. The following example 
illustrates how this result can be used to advantage. 


► Example 10.13 


Consider the initial-value problem 


£-*’+,’+, + 1. 

y( o) = o. 


(10.27) 


The differential equation of this problem cannot be solved explicitly by any of the 
methods which we have studied, but the differential equation dy/dx = y + 1 can be. If x 
and y are sufficiently small, the difference |(x 2 + y 2 + y + 1) — (y + 1)| = |x 2 + y 2 | 
will be less than or equal to any given e > 0. Thus the solution of problem (10.27) will 
differ from that of the problem 


dy . 
d^- y + l 


y( 0) = o 


(10.28) 


by an arbitrarily small amount if x and y are sufficiently small. We can thus use the 
explicit solution of problem (10.28) to obtain information about the solution of 
problem (10.27) in a sufficiently small neighborhood of (0, 0). 


Exercises 

1. The differential equation dy/dx = sin(xy) cannot be solved explicitly by any of our 
methods, but the differential equation dy/dx = xy can be. Explain why and how we 
can use the explicit solution of the initial-value problem 

£ = xy, y(0) = 0, 
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to obtain information about the solution of the initial-value problem 

= sin(xy), >(0) = 0, 
if x and y are sufficiently small. 


10.4 EXISTENCE AND UNIQUENESS THEOREMS FOR SYSTEMS 
AND HIGHER-ORDER EQUATIONS 

A. The General Case 

Our first concern is the basic existence and uniqueness theorem for an initial-value 
problem involving a system of n first-order differential equations of the form 

^r = /i i{x,yi,y 2 ,---,y«), 

^j 1 = f2(x,yi,yz,...,y„), (10.29) 


dy„ .. , 

= f n (x,y u y 2 ,--.,y n ), 

in the n unknowns y l9 y 2 y„ , where f l9 / 2 ,..., /„ are n continuous real functions 
defined in some domain D of real (n + l)-dimensional x, y i9 y 2 , . . . , y„ space. 


DEFINITION 

By a solution of the system (10.29) we shall mean an ordered set of n continuously 
differentiable real functions 

4>l, 

defined on some real x interval a < x < b such that 

[x, <Mx), </> 2 (x),..., 4>„(x)] 6 D 

and 


= fi IX (*)> • • • , 4>Jx)~L 

d ~^~ = / 2 [x, <M*)> <t> 2 {x),..., (A„(x)], 


dx 


= /.IX <t> i(x), 4> 2 (x),..., 4> n (x)l 


for all x such that a < x < b. 
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Corresponding to Theorem 10.1 we have following theorem dealing with the system 
(10.29). Since its proof parallels that of Theorem 10.1 we shall merely outline the major 
steps of the proof and omit the details. 


THEOREM 10.5 


Hypothesis 

1. Let the functions f x , A, . . . , /„ be continuous in the {n + 1 fdimensional rectangle R 
defined by 

\x-x 0 \<a, \y i -c l \<b i ,...,\y n -c n \<b n , 

where (x 0 , c n ) is a point of real (n -f 1 ydimensional (x, y l5 ..., y„) space and a , 

b l ,...,b n are positive constants. 

Let M be such that 


\Mx,y l ,y 2 ,.--,y„)\ ^ M 


fori= 1,2,..., n for all (x, y lt y 2 ,.. ., y n ) e R. 


Let h = min 


ft, ftj 



2. Let the functions f(i = 1,2 ,...,n) satisfy a Lipschitz condition with Lipschitz 
constant k in R. That is, assume there exists a constant k > 0 such that 


I //(*. y\,yi>---> y«) - /•(*> y\,yi,---> y»)l 

^ fed j'l - i'll + \h -h\ + ••• + \y„ - y„l) (10.30) 

for any two points (x,y l ,y 2 ,..., y„), (x, y u y 2 , . . . , y„) e R, and for i = 1,2,..., n. 


Conclusion. There exists a unique solution 

01, 02, — , fin 


of the system 


j^- = fi(x,yi,y 2 ,-,y«)> 

dy 2 

-j^ = f 2 (x,yi, y 2 ,.--, y„\ 


(10.29) 


dy„ . 

-^ = L(x,y x,y 2 ,...,y n ), 

such that 

<M*o) = Cl, <p 2 (x o) = c 2 ,...,4> n (x 0 ) = c„, 

defined for \x — x 0 \ < h. 

Outline of Proof. We first define functions 0 tJ by 

0;,oM = c« (i= 1, 2,..., n) 
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and 

4>i,j(x) = Cl + \ filt, 01,;_ x (f), - 0„. j- i(t)] dt (10.31) 

Jx o 

(i = 1, 2,..., n; j = 1, 2, 3,...). 

We then prove by mathematical induction that all of the function faj so defined are 
continuous and satisfy the relations 


I <Pi,j(x) - Cj\ <b; (i = 1, 2,..., n; j = 1, 2, 3,...) 


on |x — x 0 | < h. This much assured, the formulas (10.31) and the Lipschitz condition 
(10.30) can be used to obtain the following inequality by another induction: 

, , , , , . M(kny~ l \ 


*ol J 


j'- 


(i =1,2,. 


n;j= 1, 2, 3,...; |x — x 0 | < h). Thus also 


\4>ij(x) - <t>ij- 


Wl< 


M ( knh) J 

—ji 


(10.32) 


(i = 1,2,..., n; j = 1, 2, 3, . . .). This enables us to conclude that for each i = 1, 2 
the sequence {4>ij} defined by 

4>i,M) = (t>i, 0 (x) + X I4>i, p (x) ~ ^,,-iM] O' = 1, 2, 3, . . .) 

P= 1 

converges uniformly to a continuous function 0^. We may then show that each 
(f>i (i = 1 , 2 ,..., n) satisfies the integral equation 

<t>i(x) = C, + fill, <Mt), • • • , tf>„(0] dt, 

Jx o 

on\x - x 0 \ < h. From this we have at once that 


= fi[x, 4>i{x),..., ^(X)] 
ax 

on\x — x 0 \ < h and 0j(x o ) = c { (i = 1, 2,..., n). 

Our outline of the existence proof is thus completed. It is clear that it paralles that for 
the case n = 1 given for Theorem 10.1. The proof of the uniqueness in the present case 
also parallels that of Theorem 10.1, and we shall leave it to the reader to make the 
necessary changes and complete the present outline. 

Theorem 10.5 can be used to obtain an existence and uniqueness theorem for the 
basic initial-value problem associated with an nth-order differential equation of the 


form 

d n y J dy 

d"- l yl 

(10.33) 


’ dx” _1 J 

DEFINITION 





Consider the differential equation 
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where f is a continuous real function defined in a domain D of real (n -f 1 )-dimensional 
(x, y, y\ • • • , y (n ~ l) )-space. By a solution of Equation (10.33) we mean a real function 
having a continuous nth derivative (and hence all lower-ordered derivatives) on a real 
interval a < x < b such that 

[x, (j)(x), 4>'(x), U (x)] 6 D 
and 

4> in Hx) = /[x, <p(x), 4>'(x), 4> {n ~ l, (x)] 
for all x such that a < x < b. 


THEOREM 10.6 


Hypothesis. Consider the differential equation 


d"y 


dy_ 

dx’ 


d"- 


dx" 


(10.33) 


where the function f is continuous and satisfies a Lipschitz condition of the form 
(10.30) in a domain D of real (n + \)-dimensional (x, y, y {n ~ l) )-space. Let 
(x 0 ,c 0 ,c u ...,c n - 1 )be a point of D. 


Conclusion. There exists a unique solution (j> of the nth-order differential equation 
such that 

<t>(x 0 ) = c 0 , fi'(x 0 ) = c u ...,(t) in ~ l) (x 0 ) = c n _ 1; (10.34) 

defined on some interval \x — x 0 \ < h about x = x 0 . 


Proof 


Let y 1 = y , 





d n l y 
dx n ~ l 


Then the nth-order differential equation (10.33) is equivalent to the system of n first- 
order equations 


dy i 
dx 


= y 2> 



(10.35) 


dy n r r -i 

~^: = f[x,yny2,‘--,yn\ 

If is a solution of (10.33) which satisfies the conditions (10.34), then the ordered set 
of functions (f) l , 0 2 ,. where = </>, </> 2 = is a solution of the 
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system (10.35) which satisfies the conditions 

<M*o) = c o, <t> 2 (*o) = c u ...,<t) n {x 0 ) = <?„_!• (10.36) 

Conversely, if </> l5 <t>„ is a solution of (10.35) which satisfies conditions (10.36), 

then the function where </> = is a solution of the differential equation (10.33) 
which satisfies condition (10.34). 

Since (x 0 , c 0 , c„_ t ) is a point of D, there exists an (n + l)-dimensional 

rectangle R about this point such that the function / satisfies the stated hypotheses in 
R. Thus the system (10.35) satisfies all the hypotheses of Theorem 10.5 in R. Hence 
there exists a unique solution 0 1? , <t> n °f system (10.35) which satisfies the 

conditions (10.36) and is defined on some sufficiently small interval \x — x 0 \ < h about 
x = x 0 . Thus if we set </> = <j> l9 the above-noted equivalence of (10.33) and (10.34) with 
(10.35) and (10.36) gives the desired conclusion. Q.E.D 


B. The Linear Case 

We now consider the linear system 

^7 = fluMyi + a l2 (x)y 2 + ••• + a lB (x)y„ + F^x), 
dv 0 

— = a 21 (x)y 1 + a 22 (x)y 2 + ■■■ + a 2n (x)y n + F 2 (x), (10.37) 


JT = a m(x)yi + a n2 (x)y 2 + ••• + a nn (x)y n + F„(x), 

where the coefficients a xj and the functions F { are continuous on the interval a < x < b. 
We first prove a lemma. 

LEMMA 

Hypothesis. Let the functions a {j and fj-(l, 2, . . . , n; j = 1, 2, ...,«) be continuous on 
the interval a < x <b. 

Conclusion. Then the functions f defined by 

fi(x, yi,yi,---,y n ) = a n (x)y 1 + a i2 (x)y 2 + • • • + a in (x)y„ + F;(x) 

(i = 1, 2, . . . , n), satisfy a Lipschitz condition on 

a < x < b, — oo < y x < + oo (i = 1, 2, . . . , n). 

That is, there exists a constant k > 0 such that 

I fi(x, h, yi,---,y n ) - fi(x, yiJ 2 ,---,yn)\ 

^ Ml 3^1 - i'll + \yi - h\ + ••• + \y„ - i\,l) 

for all x such that a < x < b and any two sets of real numbers y i , y 2 , . . . , y n and y i , y 2 , • • • , 
y„(i= 1,2,3,..., ri). 
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Proof. Since each of the functions a V3 is continuous on a < x < b, corresponding to 
each of these functions there exists a constant such that |a 0 (x)| < for x e [a, b ] 
(i = 1, 2, . . . , n;j = 1, 2, , n). Let k = ma x{k y } for i = 1, 2, . . . , n;j = 1, 2, . . . , m. Then 
| a 0 -(x)| < fcforx g [a, b]. Then for every x e [ a , b ] and any two sets of real numbers 
y 2 ,...,y n and y 2 ,..., j>„, we have 

I fi(x, y u y 2 , ■ ■ ■ , y») - ft(x, y u y 2 . • • • . 5^)1 

= I«,-1 + • • • + + F f (x) 

- an(x)y 1 - a,2(x)y 2 aj*)?. - F f (x)| 

= ki(x)[yi - y t ] + a i2 (x)[y 2 - y 2 ] + ••• + a in {x)[y„ - j7„]| 

< I «. i M 1 1 yi - y 1 1 + I «.-2 (*) 1 1 h - h I + • • • + 1 «.-.(*) 1 1 y n - y n I 
^ fc(l + 1 yi — yi \ + ■" + 1 y» — yJ)- q.e.d. 

We now obtain the basic existence theorem concerning the linear system (10.37). 


THEOREM 10.7 
Hypothesis 

7. Let the coefficients a i} and the functions F t (i , j = 1, 2, . . . , n) in the linear system 
dy x 

— = a il (x)y 1 + a l2 (x)y 2 + + a ln {x)y„ + F t (x), 

dy 2 

— = a 2l (x)y i + a 22 (x)y 2 + ■ ■ • + a 2 „(x)y n + F 2 (x), 

(10.37) 

dy n 

-j- = a n i(x)yi + a„ 2 (x)y 2 + ■■■ + a nn (x)y„ + F„(x), 

be continuous on the real interval a < x < b. 

2. Let x 0 be a point of the interval a < x < b, and let c l9 c 2 , . . . , c n be a set of n real 
constants. 

Conclusion. There exists a unique solution 

4>l, 

of the system (10.37) such that 

<M*o) = C U 4>2(x 0 ) = C 2 ,..., 4 >„(x 0 ) = C„, 
and this solution is defined on the entire interval a < x < b. 

Outline of Proof. The system (10.37) is a special case of the system (10.29) with 
which Theorem 10.5 is concerned, and the present outline of proof parallels that given 
for Theorem 10.5. We first define functions fcj by 

<t>iA x ) = c i (* = 1» 2,. n) 
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and 


4>i,M) = Cj + [a,i(0^i,j-i(0 + ••• + aMtpnj-dt) + Fiitfldt (10.38) 

J*o 

(i = 1, 2, . . . , n\ j = 1, 2, 3, . . .) on a < x < b. The functions faj so defined are con- 
tinuous on the entire interval a < x < b. Also, by hypothesis there exists M > 0 such 
that + * * * + a in (x)c n + F)(x)| < M (i = 1, 2, . . . , n), a < x < b. 

By the lemma the functions defined by 


a il (x)y l + a i2 (x)y 2 + • • • + a in (x)y n + Ffx) 


satisfy a Lipschitz condition on a < x < b. We can thus use the formulas (10.38) and 
this Lipschitz condition to obtain by induction the inequality 


I <M*) - <t>i,j - iMI 


M(kn) j ~ l \x — x 0 \ j 

7 ! 


(i = 1, 2, . . . , n; j = 1, 2, 3, . . .) on the entire interval a < x < b. Thus also 


l<M*) “ ^ 


M ( knH) j 
kn j\ 


(10.39) 


(i = 1, 2,..., n;j = 1, 2, 3,...), a < x < b, where H = max(|a — x 0 |, \b — x 0 |). The 
inequality (10.39) here corresponds to the inequality (10.32) in the proof of 
Theorem 10.5. The remainder of the proof outlined for Theorem 10.5 now carries over 
to the present case for a < x <b and we obtain the desired conclusion. 

Now we are in a position to obtain the basic existence theorem for the initial-value 
problem associated with a single /ith-order linear differential equation. 


THEOREM 10.8 


Hypothesis 

1. Consider the differential equation 

d n v d n ~ l y dy 

a °^dx :: + + + a " _1 ^dx + a "^ y = F(X *’ ( 10 - 4 °) 

where a 0 ,a l9 ... 9 a n - i9 a n9 and F are continuous on the interval a < x < b and a 0 (x) # 0 
on a < x <b. 

2. Let x 0 be a point of the interval a < x < b, and let c 09 c l9 . . . , c„ _ x be a set of n real 
constants . 


Conclusion. There exists a unique solution (j) of (10.40) such that 

</>(* o) = Co. <t>\x o) = C,,..., ^ (n_1) (x 0 ) = C „_ l5 (10.41) 

and this solution is defined over the entire interval a < x < b. 

Proof. As in the proof of Theorem 10.6, we let 

dy d n ~ l y 
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Then the nth-order linear differential equation (10.40) is equivalent to the linear system 
dy t 


dx 

dyi 

dx 


y 2 > 


= }>3, 


(10.42) 


^ = _fU*) £ iW F (*) 

dx a 0 (x) yi a 0 (x) yi a 0 {x) y ” a 0 (x)' 

If 0 is a solution of (10.40) which satisfies the conditions (10.41), then the ordered set of 
functions 0 L , 0 2 ,..., 0«> where 0! = 0, 02 = 0',..-, 0„ = 0 ( ” _1) , is a solution of the 
linear system (10.42) which satisfies the conditions 

0i(*o) = c o> 0 2 (*o) = 0„(x o ) = (10.43) 

Conversly, if 0 l5 . . . , 0„ is a solution of (10.42) which satisfies (10.43), then the function 
0 = 0! is a solution of the differential equation (10.40) which satisfies conditions 
(10.41). 

The system (10.42) is simply a special case of the linear system (10.37) to which 
Theorem 10.7 applies. Thus the system (10.42) possesses a unique solution 0!,..., 0„ 
defined on the entire interval a < x < b which satisfies the conditions (10.43). Thus if 
we set 0 = 0!, the above-noted equivalence of (10.40) and (10.41) with (10.42) and 
(10.43) gives the desired conclusion. Q.E.D 

Observe that this is the basic existence theorem which was first stated without proof 
at the beginning of Chapter 4 (Theorem 4. 1 ). T wo examples illustrating it were given in 
that chapter (page 104) and it has been tacitly employed in the intervening pages 
whenever initial-value problems were solved. We given one further example. 


► Example 10.14 


Consider the initial-value problem: 


(x 2 


6 ) 


£ + (*’ + 4) £ + 


l 


dx 2x -f 3 


y = e 


y(2) = 0 , 

y'( 2) = 4. 

The coefficient of y is continuous except at x = — f . The remaining coefficients and the 
nonhomogeneous term are continuous for all values of x, — oo < x < oo. The leading 
coefficient (x 2 — x — 6) equal zero at x = — 2 and x = 3. Thus the hypothesis of 
Theorem 10.8 is satisfied in every closed interval a < x < b such that — | < a < 
x 0 = 2 < b < 3. Therefore the given initial-value problem has a unique solution, and 
we are assured that this solution is defined over every such closed interval a < x <b. 
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In Chapter 4 we stated an important corollary to this theorem concerning the 
homogeneous equation 

d n y d nl y dy 

a o(*) ^ + a t (x) + • • • + a„_ ,(x) ^ + a n (x)y = 0. (10.44) 

We now restate this corollary, which will soon be very useful to us, and give its proof. 


COROLLARY 

Hypothesis. The function 0 is a solution of the homogeneous equation (10.44) such 
that 

4>(x o) = 0, 4>'(x o) = 0,..., 0 ( " _1) (*o) = 0, (10.45) 

where x 0 is a point of an interval a < x < b on which the coefficients a 0 ,a i9 ... 9 a n are all 
continuous and a 0 (x) / 0. 

Conclusion. 0(x) = 0 for all x such that a < x < b. 

Proof. First note that 0 such that 0(x) = 0 for all x e [a, b~\ is indeed a solution of 
the differential equation (10.44) which satisfies the initial conditions (10.45). But by 
Theorem 10.8 the initial-value problem composed of Equation (10.44) and conditions 
(10.45) has a unique solution on a < x < b. Hence the stated conclusion follows. 


Exercises 


1. Consider the third-order differential equation 


d 3 y 2 dy 

+y T x + 



of the form (10.33) of the text. 

(a) Does there exist a unique solution 0 of the given equation such that 

0(0)= 1, 0'(O) = — 3, 0"(O) = 0? 

Explain precisely why or why not. 

(b) Find the system of three first-order equations of the form (10.35) to which the 
given third-order equation is equivalent. 

2. Does there exist a solution of the initial-value problem 


d 4 * 


(x 2 - 4) ^ + 2x ^ + (sin x)y = 0, 


y(0) = 0, y'(0) = 1, y"(0) = 1, /"(0) = — 1 ? 

If so, is the solution unique and over what interval are we assured that it is defined? 
Explain precisely. 
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3. Given that each of the functions and f 2 defined for all x by 


/iM = sin x and f 2 (x ) = £ 




« = 1 ( 2 n - 1 )! 


are solutions of the initial-value problem 

d 2 y 
dx 


-r L 2+y = o, 


>’(0) = 0, y'(0) = 1, 

for all x, — oo < x < oo, what theorem enables us to conclude that f x (x) = f 2 (x) for 
all x, — oo < x < oo? Explain. 


4. Consider the differential equation 

d 2 y 
fa 


a 0 (x) 37Y + ai{x) + a 2 (x)y = 0 , 


(A) 


where a 0 , a u and a 2 are continuous for all x, — oo < x < oo, and a 0 (x) # 0 for all 
values of x. 

(a) Let / be a nontrivial solution of differential equation (A), let /' denote the 
derivative of /, and let x 0 e [ a , ft]. Prove that if /(x 0 ) = 0, then /'(*o) # 0. 

(b) Let / and g be two distinct nontrivial solutions of differential equation (A), 
and suppose there exists x 0 e [a, b] such that /(x 0 ) = 0 (x o ) = 0. Prove that 
there exists a constant c such that / = eg. 

[Hint: Observe that the function h defined by h(x) = Af(x) — Bg(x ), where 
A = g'(x 0 ) and B = f\x 0 ), is also a solution of differential equation (A).] 

5. Complete the details of the proof of Theorem 10.5. 

6 . Complete the details of the proof of Theorem 10.7. 



CHAPTER ELEVEN 

The Theory of Linear Differential Equations 


We stated and illustrated certain basic theorems on higher-order linear differential 
equations in Chapter 4 and on linear systems in Chapter 7. Our main purpose in 
introducing them in these earlier chapters was to have them available as background 
material for the methods which followed. In this chapter it is the theory itself that is of 
primary concern. Here we shall restate and also prove the various theorems previously 
introduced and shall also proceed to obtain and illustrate further theorems on higher- 
order linear equations and linear systems. We shall begin by developing the basic 
theory of linear systems. 


11.1 INTRODUCTION 

We consider the normal form of linear system of n first-order differential equations in n 
unknown functions x 1 ,x 2 ,...,x w . As noted in Section 7.1A, this system is of the 
form 


dx 1 

— = a„(J)x, + a l2 (t)x 2 + ■■■ + a ln (t)x„ + F t (t), 
dx 

-yy = 021(0*1 + 022 ( 0*2 + ' • ' + a 2n (t)x„ + F 2 (t), (1 1.1) 


dx „ 

dt 


= 0„l(O*l + a„ 2 ( 0*2 + ••• + 0„„(O*n + F„(t). 


We shall assume that all of the functions defined by a^t), i = 1,2,..., n,j = 1,2,..., n, 

505 
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and Fi(t) 9 i = 1,2 ,...,n, are continuous on a real interval a < t < b. If all F^t) = 0, 
i = l,2,...,n, for all t, then the system (11.1) is called homogeneous. Otherwise, the 
system is called nonhomogeneous. 


► Example 11.1 

The system 


= — x 2 + 6x 3 , 


— — = — 10xi + 4 x 2 — 12x 3 , 
dt 


= - 2x i + x 2 - x 3 , 


is a homogeneous linear system of the type (11.1) with n = 3 and having constant 
coefficients. The system 

= lx l — x 2 + 6x 3 — 5t — 6, 


— — = — lOx! + 4 x 2 — 12x 3 — 4 1 + 23, 
dt 


— 7 — = - 2x i + x 2 — x 3 + 2, 

dt 12 3 

is a nonhomogeneous linear system of the type (1 1.1) with n = 3, the nonhomogeneous 
terms being — 5t — 6, — 4t + 23, and 2, respectively. 

We note that the system (11.1) can be written more compactly as 

dx ■ n 

-77 = Z a iM) x i + F M (i = 1,2,..., n). 
at j= 1 

We shall now proceed to express the system in an even more compact manner using 
vectors and matrices. We introduce the matrix A defined by 


A (t) = 


/«n(0 a l2 (t) 

fl 2 l (0 a 22(0 

iflniW a n2 (t) 


and the vectors F and x defined respectively by 


F(0 = 


and x = 


x. 
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Then (1) by definition of the derivative of a vector, and (2) by multiplication of a matrix 
by a vector followed by addition of vectors, we have respectively 

jdx i\ 


dx 

It 


\ dt 

and 


dt 

dx 2 

dt 

dx„ 



^u(f) a i2 (t) ••• a lw (t)^ 


/* i\ 


(fM 

A(r)x + F(t) = 

a 2l (t) a 22 (t ) a 2n (t) 


*2 

+ 

F 2 (t) 


Ki(0 a„ 2 (t) ■■■ a„„(t)j 


W J 


[fm] 


hu(t)xi + 012 ( 0*2 + ••• + oi„(0*„ + F t (t) ] 

= «2l(0*l + «22(0*2 + ••• + O 2 „(0*„ + F 2 (t) 

\a„ l (t)x l + a„ 2 (t)x 2 + ••• + a„„{t)x n + F n {t)j 

Comparing the components of dx/dt with the left members of (11.1) and the 
components of A (t)x + ¥(t) with the right members of (11.1), we see that system (11.1) 
can be expressed as the linear vector differential equation 

dx 

— = A(t)x + F(r). (11.6) 

Conversely, if A (t) is given by (1 1.4) and F (t) and x are given by (1 1.5), then we see that 
the vector differential equation (1 1.6) can be expressed as the system (1 1.1). Thus the 
system (1 1.1) and the vector differential equation (1 1.6) both express the same relations 
and so are equivalent to one another. We refer to (1 1.6) as the vector differential equation 
corresponding to the system (11.1), and we shall sometimes call the system (11.1) the 
scalar form of the vector differential equation (11.6). Henceforth throughout this 
section, we shall usually write the system (1 1.1) as the corresponding vector differential 
equation (11.6). 


► Example 1 1 .2 

The vector differential equation corresponding to the nonhomogeneous system (11.3) 
of Example 11.1 is 


— = A(r)x + F(r), 
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where 


A(0 = j 

1 7 
-10 

-1 

4 

-12 

, X = 

X X 

KJ H- 

, and F (t) = | 

1 -5t-6\ 
-At + 23 


l - 2 

1 





l ^ 1 


Thus we can write this vector differential equation as 


dx 

dt 


where x is the vector with components x u x 2 , x 3 , as given above. 


1 7 

-1 

6^ 


1 

1 

ON 

-10 

4 

-12 

X + 

-At + 23 

\ -2 

1 



2 


DEFINITION 


By a solution of the vector differential equation (11.6) we mean an n x 1 column vector 
function 


/<M 



W 


(11.7) 


whose components </>i, 0 2 » • • • > eac h h ave a continuous derivative on the real interval 
a < t < b, which is such that 


~r = Aw#*) + m 


( 11 . 8 ) 


for all t such that a < t < b. In other words , x = <|)(t) satisfies the vector differential 
equation (11.6) identically on a < t < b. That is , the components (j) x , (f) 2 , . . • , o/ tire 
such that 


= 0i(O> 

X 2 = 02(0. 

= 0n(O, 


(H.9) 


simultaneously satisfy all n equations of the scalar form (11.1) of the vector differential 
equation (1T6) for a < t < b. Hence we say that a solution of the system (11.1) is an 
ordered set of n real functions (f) l ,<^ 2 > each having continuous derivatives on 
a < t < b, such that 


x 1 = 0l(O. 

*2 = 02 ( 0 , 

X„ = 0 n (O, 

simultaneously satisfy all n equations of the system (11.1) for a < t <b. 


(11.9) 
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► Example 1 1 .3 


The vector differential equation corresponding to the homogeneous linear system 

dXi 

— = 7Xi - x 2 + 6x 3 , 


dx 2 
~dx 


— lOXi + 4 x 2 — 12x 3 , 


(11.2) 


dx 3 
dt 


= — 2x { + x 2 — x 3 , 


IS 


dx 

~dt 


The column vector function <|) defined by 


(-.0 

-1 

4 

-12 

x, where x = 

(x i\ 
*2 

l- 2 

1 

- 1 ) 


\ x ‘l 


4K0 = 



(11.10) 


is a solution of the vector differential equation (1 1.10) on every real interval a < t < b; 
for x = c|>(r) satisfies (1 1.10) identically on a < t < b, that is, 


Thus 


1 ^ \ 

I 

— 6e 3 ' 

- 

\-3 e*l 

\ 



x 2 = - 2e 3t , (11.11) 

x 3 = — e 3t , 


simultaneously satisfy all three equations of the system (1 1.2) for a < t < b, and so we 
call (1 1.1 1) a solution of the system. 

The linear system (10.37) of Theorem 10.7 and the linear system (1 1.1) of this chapter 
are identical, except for obvious changes in notation. Thus we see that Theorem 10.7 is 
in fact the basic existence and uniqueness theorem dealing with the linear system (1 1.1). 
Rewriting this theorem in the notation of vectors and matrices, we thus have the 
following basic existence and uniqueness theorem dealing with the vector differential 
equation (1 1.6). 


THEOREM 11.1 

Consider the vector differential equation 


— = A (t)x + F(t) 


( 11 . 6 ) 
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corresponding to the linear system (11.1) of n equations in n unknown functions. Let the 
components a^t), i = 1, 2, . . . , n,j = 1, 2, . . . , n, of the matrix A (t) and the components 
F t (t), i = 1,2 ,...,n,of the vector F (t) all be continuous on the real interval a < t < b. Let 
t 0 be any point of the interval a < t < b, and let 


M 

c= * 

w 

be an n x 1 column vector of any n numbers c lf c 2 . 
Then there exists a unique solution 

I* A 



\0n/ 

of the vector differential equation (11.6) such that 

<t>(r 0 ) = c; 


that is, 


0l(^o) — C l> 


02( ? o) “ C 2> 


0n(^o) 

and this solution is defined on the entire interval a < t < b. 


(11.12) 


(11.13) 


11.2 BASIC THEORY OF THE HOMOGENEOUS LINEAR SYSTEM 

We now assume that all F t (t) = 0, i = 1,2,..., n, for all t in the linear system (1 1.1) and 
consider the resulting homogeneous linear system 

dx 

~^ = a n(t)xi + a 12 (t)x 2 + ••• + a ln (t)x„, 
dx 

-jf- = «21 (0*1 + <*22(0*2 + ' ' ' + «2n(0*n, (1 1.14) 


= a^tfXi + a„ 2 (0*2 + ••• + a nn (t)x n . 

The corresponding homogeneous vector equation is the equation of the form (1 1.6) for 
which F(t) = 0 for all t and hence is 
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Throughout Sections 11.2 and 11.3 we shall always make the following assumption 
whenever we write or refer to the homogeneous vector differential equation (1 1.15): We 
shall always assume that (1 1.15) is the vector differential equation corresponding to the 
homogeneous linear system (1 1.14) of n equations in n unknown functions and that the 
components a^t), i = 1,2 ,...,n,j = 1,2,..., n, of the n x n matrix A (t) are all continu- 
ous on the real interval a < t < b. Our first result concerning Equation (11.15) is an 
immediate consequence of Theorem 11.1. 


COROLLARY TO THEOREM 11.1 


Consider the homogeneous vector differential equation 

dx A / \ 

* - a(,)x - 

Let t 0 be any point of a < t < b; and let 

AM 



W 

be a solution of (11.15) such that (J>(r 0 ) = 0, that is, such that 

0l(^o) = 02^o) = * " = </>„(£()) = 0. 
Then c|)(t) = 0 for all t on a < t < b; that is, 

t) = ••• = = 0 

for all t on a < t < b. 


(11.15) 


(11.16) 


Proof. Obviously (|) defined by ty(t) = 0 for all t on a < t < b is a solution of the 
vector differential equation (11.15) that satisfies conditions (11.16). These conditions 
are of the form (11.13) where c x = c 2 = **• = c n = 0; and by Theorem 11.1 there is a 
unique solution of the differential equation satisfying such a set of conditions. Thus 
<|) such that <|)(t) = 0 for all t on a < t < b is the only solution of (11.15) such that 
Wo) = 0 . QE.D. 


THEOREM 11.2 

A linear combination of m solutions of the homogeneous vector differential equation 

dx 

— = \(t)x (11.15) 

is also a solution of (11.15). Thatis,if the vector functions cj) 1 ,cj) 2 ,...,<|) m are m solutions 
of (11.15) and c l ,c 2 ,..-,c m are m numbers, then the vector function 

m 

<l>= X c*<K 

k= 1 


is also a solution of (11.15). 
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Proof. We have 


d r m 

~n Z C *<M0 

at |_fc = i 


m r d ~| m 

Z J t c k< K(0 = Z c k 
k = 1 J k=l 


<%(0 

dt 


Now since each <|) fc is a solution of (11.15), 


d+k(t) 

dt 


= A(0<|> t (f) 


for k = 1, 2,..., m. 


Thus we have 


d 

It 


Z c Mt) 


k= 1 


= Z c*A(f)<M0- 


We now use Results A and B of Section 7.5A. First applying Result B to each term in the 
right member above, and then applying Result A (m — 1) times, we obtain 


X c*A(t)<M0 = Z A(r)[c*<Mr)] = Aft) £ c k 4>M 

k = 1 k= 1 k=l 


Thus we have 


that is, 


d 

dt 


X c k$k(t) 

k= 1 


#(0 


dt 


m 

= Aft) £ c k$k(t) 

_k= 1 


= A(t)4>(t), 


for all t on a < t < b. Thus the linear combination 

m 

<l> = Z C k$k 


is a solution of (11.15). Q.E.D. 

Before proceeding, the student should return to Section 7.5C and review the concepts 
of linear dependence and linear independence of vector functions. 

In each of the next four theorems we shall be concerned with n vector functions, and 
we shall use the following common notation for the n vector functions of each of these 
theorems. We let , (t> 2 , . . . , c|)„ be the n vector functions defined respectively by 



l<Pn(t)\ 


/ $12W\ 


/*i«(0\ 

<M0 = 

[ 021 (0 

, = 

' *22(0 


*2.(0 


\<M0/ 






Carefully observe the notation scheme. For each vector, the first subscript of a 
component indicates the row of that component in the vector, whereas the second 
subscript indicates the vector of which the component is an element. For instance, 0 35 
would be the component occupying the third row of the vector <|) 5 . 
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DEFINITION 

The n x n determinant 

011 012 *** 01 w 

4>21 4> 22 •" <t>2n ( 1U8) 

0*1 0*2 *“ 0 ** 

is called the Wronskian of the n vector functions <|>i, <t> 2 , - - , <|> n defined by (11.17). We 
will denote it by W{( J> 1? <t> 2 , — , <|>„) and its value at t by W(i h, <J) 2 , • • • , <t>„)(0* 


THEOREM 11.3 

If the n vector functions c)^, cj> 2 , . . . , <(>„ defined by (11.17) are linearly dependent on 
a < t < b, then their Wronskian W( <(>!, f\> 2 ,..., 4>„)(t) equals zero for all t on a < t < b. 

Proof. We begin by employing the definition of linear dependence of vector 
functions on an interval: Since (|>i,(|>2,- ••,<(>« are linearly dependent on the interval 
a < t < b, there exist n numbers c 1 ,c 2 ,...,c„, not all zero, such that 

Cl<M0 + C 2<M0 + *“ + C *<M0 = 0 

for all t g [a, b~\. Now using the definition (11.17) of <(>! , c(> 2 , . . . , c|>„, and writing the 
preceding vector relation in the form of the n equivalent relations involving cor- 
responding components, we have 

Ci0n(O + C 2 0 12 (O + ••• + C B 0 ln (O = 0, 

Ci02lW + C 2 (f) 22 (t) + “• + C„0 2 „(O = 0, 


Cl 0*1 (t) + c 2 0„ 2 (O + ••• + C n <j) nn (t) = 0, 

for all t e [a, b]. Thus, in particular, these must hold at an arbitrary point t 0 e [a, b~\. 
Thus, letting t = t 0 in the preceding n relations, we obtain the homogeneous linear 
algebraic system 

01l(*o) C l + 012^o) C 2 + *“ + 01*(^o) C * = 

02l(*o) C l + 022^o) C 2 + *** + 02*(*o) C * = 


4>nl(to)Cl + 0* 2 (*O)C 2 + ••• + 4>nn(to)C n = 0, 

in the n unknowns c i9 c 2 , . . . , c n . Since c l9 c 2 ,...,c n are not all all zero, the determinant 
of coefficients of the preceding system must be zero, by Theorem A of Section 7.5C. 
That is, we must have 

01l(*o) 01 2(^0) *** 01*(*o) 

02 1 (^0) 022^o) *** 02*(^o) _ n 


0* l(*o) 0*2^o) *** 0*n(^o) 
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But the left member of this is the Wronskian W(fy u <j> 2 , . . . , <t>,,)(t 0 ). Thus we have 

^(ct> 1 ,ct) 2 ,...,ct)J(t 0 ) = 0. 

Since t 0 is an arbitrary point of [a, f>], we must have 

for all t on a < t < b. Q.E.D. 


Examine the proof of Theorem 1 1.3 and observe that it makes absolutely no use of 
the properties of solutions of differential equations. Thus it holds for arbitrary vector 
functions, whether they are solutions of a vector differential equation of the form 
(11.15) or not. 


► Example 1 1 .4 


In Example 7.31 of Section 7.5C we saw that the three vector functions <|> 2 , and <j> 3 
defined respectively by 


<M0 = 

(e 2 ') 

2e 2 ' 

, 4>z(«) = 

e 2 ' \ 

4e 2 ' 

, and <t> 3 (f) = 

1 e 2t \ 
e 2t 


v e2 1 


^1 le 2, j 


v 2 / 


are linearly dependent on any interval a <t < b. Therefore, by Theorem 11.3 their 
Wronskian must equal zero for all t on a < t < b. Indeed, we find 



e 2t 

e 2t 

e 2t 

WW,, <>2, 4> 3 )(0 = 

2e 2t 

4e 2t 

e 2t 


5e 2 ' 

lle 2t 

2e 2t 


for all t. 


THEOREM 11.4 

Let the vector functions c|>, c|> 2 , . . . , <)>„ defined by(11.17)ben solutions of the homogeneous 
linear vector differential equation 

■jj~ = A(t)x. (11.15) 

If the Wronskian W{ty { , <|> 2 , — , <k)(*o) = 0 at some t 0 e [a, b], then <(>! , <|> 2 are 
linearly dependent on a < t < b. 

Proof. Consider the linear algebraic system 

C l 01l(*o) + ^2 01 2 (^ 0 ) + ’ * ' + Cnfilnito) = 

C 102l(^o) + C 2 + “* + C n02n(^o) = 0, (11.19) 


C 10fil(^o) + C 20 ii2(*o) + *“ + 

in the n unknowns c l5 c 2 , . . . , c n . Since the determinant of coefficients is 
^(<l>i,<l> 2 ,---,<k)(*o) and W(ty l9 (|) 2 ,..., <t> n )(t 0 ) = 0 



11.2 BASIC THEORY OF THE HOMOGENEOUS LINEAR SYSTEM 515 


by hypothesis, this system has a nontrivial solution by Theorem A of Section 7.5C. 
That is, there exist numbers c l9 c 2 ,..., c„, not all zero, that satisfy all n equations of 
system (11.19). These n equations are the n corresponding component relations 
equivalent to the one vector relation 

Ci<t>i(fo) + c 2 <Mto) + • + c„<t>„(t 0 ) = 0- (11.20) 

Thus there exist numbers c l5 c 2 ,..., c„, not all zero, such that (11.20) holds. 

Now consider the vector function c|> defined by 

= ci<Mf) + c 2 <Mi) + • + (H-21) 

for all t g [a, b~\. Since <|> 1 , <|> 2 ,..., <(>„ are solutions of the differential equation (11.15) 
by Theorem 11.2, the linear combination c|> defined by (11.21) is also a solution of 
(11.15). Now from (11.20), we see that this solution c|> is such that c|>(£ 0 ) = 0. Thus 
by the corollary to Theorem 11.1, we must have c|>(£) = 0 for all t e [a, b], That is, 
using the definition (1 1.21). 

Ci<M0 + c 2 <M0 + •• + c„<J>„(t) = 0 

for all t g [a, b] 9 where c x , c 2 , . . . , c n are not all zero. Thus, by definition, 4>i , § 2 , • • • , <|>„ 
are linearly dependent on a < t < b. Q.E.D. 


► Example 1 1 .5 


Consider the vector functions and ct> 3 defined respectively by 



( e3 ‘ \ 

( ^ ) 


f-3e 3, \ 

<M0 = j 

— 2e 3 ' , <|> 2 (t) = 

-4e ! ' 

, and <( > 3 (t) = 

6e 3t 


\-e 3 < 

i-H 


\ l* 3 ' / 


It is easy to verify that <|>i , <J> 2 , and <j> 3 are all solutions of the homogeneous linear vector 
differential equation 


7 

-10 

-1 

4 

-12 

X, 

where x = 

i x A 

x 2 

I" 2 

1 

~ l ) 



[ X3 I 


( 11 . 10 ) 


on every real interval a <t < b (see Example 1 1.3). Thus, in particular, <|>i, <|> 2 , and <t> 3 
are solutions of (1 1.10) on every interval [a, b~\ containing t 0 — 0. It is easy to see that 


^(<t>i,<t>2,<t> 3 )(0) 


1 2 
-2 -4 

-1 -2 


-3 

6 

3 


= 0 . 


Thus by Theorem 11.4, <|> 1 , cj) 2 , and cj) 3 are linearly dependent on every [a, b~\ 
containing 0. Indeed, note that 

<J>i (0 + <M0 + <M0 = 0 

for all t on every interval [a, b], and recall the definition of linear dependence. 


Note. Theorem 11.4 is not true for vector functions <(>„ that are not 

solutions of a homogeneous linear vector differential equation (11.15). For example, 
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consider the vector functions (J^ and c|> 2 defined respectively by 


<M0 



and <t> 2 (0 



It can be shown that and cj) 2 are not solutions of any differential equation of the form 
(1 1.15) for which n = 2 [see I. G. Petrovski, Ordinary Differential Equations (Prentice- 
Hall, Englewood Cliffs, N.J., 1966), Theorem, pages 1 10-1 1 1, for the method of doing 
so]. Clearly 




^0 

0 


0 


= 0 


for all t 0 in every interval a < t < b. However, and c|> 2 are not linearly dependent. To 
show this, proceed as in Example 7.32; see Exercise 6 at the end of Section 7.5C. 


THEOREM 11.5 


Let the vector functions c|) 1 ,c|) 2 ,...,c() n defined by {11.17) be n solutions of the homo- 
geneous linear vector differential equation 


dx A / \ 

T, ~ A(,)x 


(11.15) 


on the real interval [a, b]. Then 


Proof. 


either W(^ u ^ 2 ,... 9 ^ n )(t) = 0 
or W(^ 2 ,...^ n )(t) = 0 

Either »r(* 1 ,+ 2 ,...,+J(0 = 0 
or » r (+ 1 ,* 2-..,+.)(0 = 0 


for all t e [a, b] 9 
for no t e [a, b~\. 

for some t e [a, b] 
for no t e [a, b]. 


If fF(<h,<|> 2 ,..., <►«)(*) = 0 for some t e [a, b] 9 then by Theorem 11.4, the solutions 
<|)i,<() 2 ,. are linearly dependent on [a, b]; and then by Theorem 11.3, 
- .,+„)W = o for all t e [a, b], Thus the Wronksian of <|>i , <(>2 , • ••,<!>„ either 
equals zero for all t e [a, b~\ or equals zero for no t e [a, b]. Q.E.D. 


THEOREM 11.6 

Let the vector functions <|>i, <|> 2 * - • defined by [1 1.17) be n solutions of the homo- 
geneous linear vector differential equation 

dx 

— = A(J)x (11.15) 

on the real interval [a, b]. These n solutions <(>!, <J> 2 ,..., ()>„ of (11.15) are linearly in- 
dependent on [ a , b] if and only if 

W(+ l9 + 29 ... 9 + H )(t)* 0 


for all t e [a, b]. 
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Proof. By Theorems 11.3 and 11.4, the solutions <t>i,<t> 2 ,...,<|>„ are linearly 
dependent on [a, b~\ if and only if \V(ty { ,ty 2 ,...,ty n )(t) = 0 for all t e [a, b~\. Hence, 
.,<()„ are linearly independent on [ a , b~\ if and only if W(§ 1 ,§ 2 ,... , <|> n )(t 0 ) # 0 
for some t 0 e \_a,b~]. Then by Theorem 11.5, W(^ l ,^> 2 ,...,^ n )(t 0 ) ^ 0 for some t 0 e 
[_a,b~] if and only if W(^ i ,^ 2 ,...,^ > n )(t) / 0 for all t e [a,b~\. Q.E.D. 

► Example 11.6 

Consider the vector functions <|>i,<|> 2 , and c|> 3 defined respectively by 


<M0 = -e 2, \, <M0 = -2e 3 ' , and <J> 3 (t) = -6e 5 ' 

\ — e 2 ‘ / \ — e 3t \ — 2e 5 ' , 


( 11 . 22 ) 


\ / \ / ' / 

It is easy to verify that <|>i , <t> 2 9 and <l >3 are all solutions of the homogeneous linear vector 
differential equation 


( 11 . 10 ) 


on every real interval a < t < b. We evaluate 

e 2t e 


(-.0 

-1 

4 

-12 

X, 

where x = 

(xi\ 

x 2 


1 




1x3/ 


e 2t 

e 3t 

3e 5 ' 

-e 2 ' 

— 2e 3t 

— 6e 5t 

-e 2t 

-e 3t 

— 2e 5t 


for all real t. Thus by Theorem 1 1.6, the solutions <t> x , <J> 2 , and cj> 3 of (1 1.10) defined by 
(11.22) are linearly independent on every real interval [a, &]. 


DEFINITIONS 

Consider the homogeneous linear vector differential equation 

dx 

— = A(t)x, (11.15) 

where xis ann x 1 column vector . 

1. A set of n linearly independent solutions of (11.15) is called a fundamental set of 
solutions of (11.15). 

2. A matrix whose individual columns consist of a fundamental set of solutions of 
(11.15) is called a fundamental matrix of (11.15). That is, if the vector functions 

<j> 2 , . . . , (j>„ defined by (1L17) make up a fundamental set of solutions of (11.15), then the 
n x n square matrix 

/<M0 <t>12(t) ••• <Pln(t)\ 

02l(O ^22(0 ■" <^2n(l) 

\<M0 4>n 2 (t) •• 4>nn(t) / 

is a fundamental matrix of (11.15). 
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► Example 11.7 


In Example 11.6 we saw that the three vector functions <J> X , ((> 2 , and <J> 3 defined 
respectively by 



(e» \ 


( e* \ 

<M0 = | 

<S <N 

1 1 

> ^ 2(0 — 

-2e 3 ' 

\ -e* ) 


and <M t) = 



(11.22) 


are linearly independent solutions of the differential equation 


dx 

dt 



( 11 . 10 ) 


on every real interval [a, b]. Thus these three solutions cf)^ <J> 2 , and <t> 3 form a 
fundamental set of differential equation (11.10), and a fundamental matrix of the 
differential equation is 


e 2 ’ 

e 3 ' 

3e 5t 

— e 2 ' 

— 2e 3 ‘ 

1 

ON 

LTt 

-e 2 ' 

— e 3t 

— 2e 5t 


We know that the differential equation (11.10) of Examples 11.6 and 11.7 has the 
fundamental set of solutions <|>i, <l >3 defined by (11.22). We now show that every 

vector differential equation (11.15) has fundamental sets of solutions. 


THEOREM 11.7 

There exist fundamental sets of solutions of the homogeneous linear vector differential 
equation 

^ = A(t)x. (11.15) 

Proof. We begin by defining a special set of constant vectors u 1 ,u 2 ,...,u n . We 
define 


l l \ 


/ 0 \ 


l°\ 

0} 


l' 


0 

0 

«2 = 

0 

, ,u„ = 

0 

0 j 


l°J 


1 ° 

\°l 


w 


w 


That is, in general, for each i = 1,2 ,...,«, u ( has ith component one and all other 
components zero. Now let (|>i , <t> 2 » ? be the n solutions of (11.15) that satisfy the 
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conditions 


that is 


<t><(t 0 ) = (f = l,2,...,n), 


<M*o) = ■l.+ 2 (*o) = «2»---,<t>n(fo) = 


where t 0 is an arbitrary (but fixed) point of [ a , b]. Note that these solutions exist and are 
unique by Theorem 1 1.1. We now find 


^(<t>l , <l>2 > • • • > 4 >n)(to) = ^(U, , U 2 , . . . , U„) = 


10-0 
0 1 ••• 0 


1 # 0 . 


0 0 ••• 1 


Then by Theorem 11.5, W^(«J) 1 ,4> 2 > »<!>«)(() ^ 0 for all te[a, 6]; and so by 

Theorem 11.6 solutions <|)i,<|) 2 ,...,<|> n are linearly independent on [a, b]. Thus, by 

definition, <t> la <|» 2 , form a fundamental set of differential equation (11.15). 

Q.E.D 


THEOREM 11.8 


Let <J>!, <|> 2 , — , <l>n defined by (11.17) be a fundamental set of solutions of the homo- 
geneous linear vector differential equation 

dx 

— = A(t)x (11.15) 

and let <|) be an arbitrary solution of (11.15) on the real interval [a, b], Then <|> can be 
represented as a suitable linear combination of <(>! , <t> 2 , • ♦ . , 4>r»; that is, there exist numbers 
c l9 c 2 ,..., c n such that 

<j) = <?!<)>! + c 2 ( )>2 + *“ + c n ty n 

on [a, b]. 


Proof. Suppose <| >(t 0 ) = u 0 , where t 0 e [a, b] and 

l u io\ 



\UnO] 

is a constant vector. Consider the linear algebraic system 


Cl<t>n(t 0 ) + ^2 01 2 (^ 0 ) + *** + C n ( t ) ln(to ) ~ W 10> 

C 102l(^o) + C 2022^o) + *** + C n ( l ) 2n(to) = U 20> 

(11.23) 


C \ 0 nl(*o) T C 2 ( Pn2{t () ) “b * * * “b ^n 0 nn(^o) 
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of n equations in the n unknowns c x , c 2 , . . . , c n . Since , <J> 2 is a fundamental set 

of solutions of (11.15) on [a, ft], we know that cj)^ cj) 2 ,. are linearly independent 
solutions on [a, ft] and hence, by Theorem 11.6, W($ u <|> 2 ,..., $ n )(to) ^ 0- Now 
observe that <|> 2 ,..., <t> n )^o) is the determinant of coefficients of system (11.23) 
and so this determinant of coefficients is unequal to zero. Thus by Theorem B of 
Section 7.5C, the system (11.23) has a unique solution for c l9 c 2 ,..., c n . That is, there 
exists a unique set of numbers c x , c 2 , . . . , c n such that 

Ci<Mt 0 ) + c 2 <M£o) + ' •' + cA(£ 0 ) = u o> 

and hence such that 


— u o — X (11.24) 

fc= i 

Now consider the vector function \| / defined by 

'HO = t C A(0- 

fc = l 

By Theorem 11.2, the vector function \| / is also a solution of the vector differential 
equation (11.15). Now note that 

n 

'K£o)= Z c A h(to)- 

k= 1 

Hence by (11.24), we obtain vl/(t 0 ) = <t>(^o)- Thus by Theorem 11.1 we must have 
\| i(t) = <| >(t) for all t g [ a , ft]. That is 

<t>(0 = t C A( t) 

fc = i 

for all t g [a, ft]. Thus <|> is expressed as the linear combination 

<l> = Ci<t>i + c 2 <|> 2 + + cA 

of <|>i, <|> 2 ,..., <I> M where c u c 2 ,..., c„ is the unique solution of system (11.23). Q.E.D. 


As a result of Theorem 1 1.8, we are led to make the following definition. 


DEFINITION 

Consider the homogeneous linear vector differential equation 

dx 

Tt =A(t)x, (11.15) 

wftere xisann x 1 column vector. By a general solution of (11.15), we mean a solution of 
the form 

Ci<t>i + C 2 < t > 2 + ••• + 


wftere c l9 c 2 ,..., c„ are n arbitrary numbers and ((h, <|> 2 ,..., <!>„ is a fundamental set of 
solutions of (11.15). 
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► Example 1 1 .8 

Consider the differential equation 


7 

-10 

-1 

4 

-12 

X, 

where x = 

(x^ 

x 2 

- 2 

1 

-1/ 



\ X3 


( 11 . 10 ) 


In Example 11.7 we saw that the three vector functions <t> 2 , and <|> 3 defined 
respectively by 


<M0 = 


-2c 3 ' , and <| > 3 (r) = -6e 5t (11.22) 

-e 3t \ — 2e 5t l 


form a fundamental set of differential equation (11.10). Thus by Theorem 1 1.8 if <|> is an 
arbitrary solution of (1 1.10), then c|> can be represented as a suitable linear combination 
of these three linearly independent solutions c|>i , <J> 2 , and c|> 3 of (1 1.10). Further, if c { , c 2 , 
and c 3 are arbitrary numbers, we see from the definition that c Y + c 2 < |> 2 + c 3 < |> 3 is a 
general solution of (11.10). That is, a general solution of (11.10) is defined by 

I e 2t \ / e 3t \ / 3c 5 ' \ 

Ci -e 2 ' +c 2 -2e 3t + c 3 — 6e 5t \ 

\-e 2 ') \-e 3 ') \- 2 e 51 ) 

and can be written as 

x i = c t e 2 ‘ + c 2 e 3 ' + 3 c 3 e 5 ', 
x 2 = —Cie 2 ' — 2 c 2 e 3 ' — 6 c 3 e St , 
x 3 = -c y e 2t - c 2 e 3 ' — 2 c 3 e 5 ', 
where c l ,c 2 , and c 3 are arbitrary numbers. 


Exercises 

In each of Exercises 1-6, determine whether or not the matrix in Column I is a 
fundamental matrix of the corresponding linear system in Column II. 


e 2 ' 

e~ 3 ‘ ' 

2c 2 ' 

7e" 3 ' 

c 2 ' 

lie -3 '. 


e 3t e 


-e 3 ‘ -e 


-e- 2t 

4e~ 2t 


e 4 ' 0 2e 4 '\ 

2e 4 ' 3e‘ 4e 4 ' , 
e 4 ‘ e* 2c 4 '/ 


1 1 - 1 \ 

2 3 -4 x. 

4 1 -4 


1 -3 9\ 

0 -5 18 x. 

0 -3 10 
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11.3 


5. 


6 . 



dx 

~di 

dx 

dt 


dx 

~di 


= x 



FURTHER THEORY OF THE HOMOGENEOUS LINEAR SYSTEM 
THEOREM 11.9 

Let 



(4> n 

012 * 

•• 4>u^ 

<D = 

<^21 

022 

4>2n 


fnl 

0n2 

" <t>nnj 


be a fundamental matrix of the homogeneous linear vector differential equation. 

dx 


dt 


= A (t)x, 


(11.15) 


and let <t> be an arbitrary solution of (11.15) on the real interval [a, b]. Then there exists a 
suitable constant vector 

M 

c = - C2 


such that 


on [a, b\ 


\ c -/ 


<|) = <Dc 


(11.25) 


Proof. Since O is a fundamental matrix of differential equation (11.15), its 
individual columns 


= 


l<t>ik\ 

02 k 
0 'nh] 


(k = 1,2,..., n), constitute a fundamental set of solutions of (11.15). Thus by 
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Theorem 11.8 there exist numbers c l9 c 2 ,...,c n such that 

<t> = + ^2<t>2 + ••• + C n$n 

on [a, b\ In component form, this is 

01 = C 1011 + C 2012 + + C *01* 
02 = C \ 02 1 + C 2022 + * * * + C *02* 


0* = Cl 0*1 + C 2 <t>n2 + + C n <j> nn 

Now observe that 


I c l<t>ll + ^2012 + • 

" + C n<£lr^ 


/*11 

012 ‘ 

" <f>ln\ 


I C 1 1 

c 1 02 1 + C 2 022 + ' 

• + Cn<t> 2n 

= 

21 

022 

4>2n 


C 2 

{ C 1 0*1 + C 20*2 + ’ ’ 

' + c n <t> nn j 


\<t>nl 

0*2 

" <Pnnj 


W 


In vector form this is precisely 

cj) = Oc, 

where 


M 



(11.25) 


Q.E.D. 


Note . In terms of the concept of general solution, we see that the result of 
Theorem 11.9 can be expressed as follows: The general solution of the homogeneous 
linear vector differential equation (11.15) can be written in the form Oc, where O is a 
fundamental matrix of (1 1.15) and c is an arbitrary n-rowed constant vector. 


► Example 1 1 .9 


As noted in Examples 1 1.7 and 1 1.8 we have seen that the three vector functions defined 
by (11.22) form a fundamental set of differential equation (11.10). It follows im- 
mediately that 


0(0 = 


I e 2 ‘ 

e 31 

3e 5 ‘ 

— e 2t 

— 2e 31 

1 

ON 

\~e 2 ' 

-e 3t 

— 2e 5 ‘ 


is a fundamental matrix of differential equation (11.10). Thus an arbitrary solution c|>(0 
of (1 1.10) can be written in the form 


where c = I c 2 


♦w = 

is a suitable constant vector. 
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THEOREM 11.10 


The unique solution of the homogeneous linear vector differential equation 


< 

II 

(11.15) 

that satisfies the initial condition 


cT 

II 

X 

o 

(11.26) 

where t 0 e [a, b] 9 can be expressed in the form 


4(0 = O(00 -1 (fo)x 0 , 

(11.27) 


where <b(t) is an arbitrary fundamental matrix of differential equation (11.15). 

Proof. By Theorem 1 1.9 there exists a constant vector c such that <| >(t) = <S>(t)c for 
all t e [ a , b~\. Thus, in particular, c|>(t 0 ) = <D(t 0 ) c; and applying the initial condition 
(1 1.26), we have x 0 = Q>(t 0 )c. The determinant |0(r)| is the Wronskian of the n linearly 
independent solutions of (11.15) that constitute the individual columns of the 
fundamental matrix <£>(t). Thus by Theorem 11.6 we have |O(t 0 )| / 0; and so O(r 0 ) is 
nonsingular and its inverse O - l (t 0 ) exists. Thus we obtain 

® _1 (to)^o = ® -1 (t 0 )®(fo)c = Ic = c. 

Now, using this value of c, we find 

4>(t) = <S>m-\t 0 )x 0 . Q.E.D. 


► Example 11.10 

Suppose we seek the unique solution <|> of the differential equation 


(-.0 

-1 

4 

-12 

x, where x = 

h\ 

x 2 

1-* 

1 

-* 


Us/ 


that satisfies the initial condition 


(11.10) 


c|>(0) = x 0 , where x 0 = 4^. (11.28) 

By Theorem 1 1.10, this solution is given by 

<t>(t) = <I>(t)<D“ 1 (0)xo, (11.29) 

where <I>(t) is a fundamental matrix of differential equation (11.10). In Example 1 1.9 we 
observed that 


e 2t 

e 3 ' 

3e 5t 

-e 2t 

— 2e 3t 

1 

Os 

Lh 

— e 2 ' 

— e 3 ‘ 

- -2e 5 ‘ 


<t>(t)=\-e 2t —2e 3t —6e 5t 

\-e 2t -e 3t -2e 5t ) 

is such a fundamental matrix. After performing the required calculations (see Exer- 
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rise 1(a) at the end of this section), we find 


<& -1 (t) = 



and hence 


4>- l (0) 


2 

-4 

1 


1 

-1 

0 


Thus, using (11.29), the solution of the problem is 


= 


. 2 1 


\-e 


„ 2 1 


-2e 

-e 3t 

„ 3t 


3 1 


= \-e* 


J2t 


2e 3t 

3t 


— e 



\- e 

2e 2 ' - 6e 3 ' + 3e 5 ' 

= | — 2e 2t + 12e 3 ' — 6e 5 ' 
— 2e 2 ' + 6e 3 ' — 2e 5 ' 


2 

-4 

li 



We write this as 


= 2e 2t - 6e 3t + 3e 5t , 
x 2 = —2e 2t + \2e 3t — 6e 5t , 
Xi = — 2e 2t + 6e 3t — 2e 5 \ 


(11.30) 


We should note that the determination of 0 -1 (t) requires some calculations (we 
again refer to the aforementioned Exercise 1(a)). For this reason, the following 
alternate procedure for solving the problem is preferable. 

In Example 11.8, we noted that the general solution of the differential equation 
(1 1.10) can be written as 

x i = c i e2t + c 2 e 3t + 3 c 3 e 5t , 

x 2 = — c x e 2r — 2 c 2 e 3t — 6 c 3 e St , (11.31) 

x 3 = — c x e 2t — c 2 e 3t — 2 c 3 e 5t . 


In terms of components, the given initial condition (1 1.28) is 
Xx(0) = — 1, x 2 (0) = 4, x 3 (0) = 2. 
Applying these conditions to (11.31) we at once have 

c \ + c 2 + 3c 3 = — 1, 

— c l — 2c 2 — 6 c 3 = 4, 

-Ci - c 2 - 2c 3 = 2. 


(11.32) 
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This system of three linear algebraic equations in the three unknowns c l9 c 2 ,c 3 has the 
unique solution 

Ci = 2, c 2 = - 6, c 3 = 1. 

Substituting these values back into (1 1.31) we again obtain the unique solution (1 1.30) 
of the given initial-value problem. 

Comparing this alternate procedure for solving the given initial-value problem with 
that which uses formula (1 1.29) we note that obtaining the solution of the system (1 1.32) 
also requires some calculations (we here refer to Exercise 1(b) at the end of this 
section). Nonetheless, the alternate procedure seems more practical. 


THEOREM 11.11 

If the n vector functions 


<!>* = 


/ 0i*\ 

02 k 


\0mi 


(k= 1 , 2 ,..., n), 


satisfy the homogeneous linear vector differential equation 

dx 


dt 


= A(t)x 


on the real interval [a, b], then the matrix function 


a> = 


011 012 
021 022 


01m 

02m 


is such that 


\0m1 0m2 0mm j 


dt 


= A(t)«D( t ) 


(11.15) 


(11.33) 


for all t e [a, b]; and conversely. 

In particular, if O is a fundamental matrix of (11.15) on [a, b], then <!> satisfies (11.33) 
on [a, b]. 

Proof. Since <(>* satisfies dx/dt = A(r)x on [a, b], we have <|>t(t) = A(t)<J>*(t) for all 
t e[a,b](k = 1,2 ,...,«). That is, 



1 

4>'jkU) 

= 


\ 


aji(t) a j2 (t ) ••• a Jn (t ) 


M)\ 

<M0 


0Mfc(O/ 
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and hence 

n 

4>'jk(t ) = X 

1=1 

for all t g [a, b] (j = 1, 2, . . . , n; k = 1,2,..., n). Now observe that this derivative (j>' jk (t) 
is the element in jth row and fcth column of the derivative <!>'(£) of <D(t). On the other 
hand, we see that the element in the j th row and the fcth column of the product 

/«n(0 «i 2(0 a ln (t) 

A(f)<D(0= a n (t) a j2 (t) ••• a Jn (t) 

\«.l(0 «„2 (0 «n«(0 

is also 

n 

i= 1 

for all r g [a, b] (j = 1, 2 ,...,n;k = 1,2,..., n). Therefore, we have 

®'(t) = A(t)<D(t) 

for all t e [a, &]. 

Conversely, suppose <D'(f) = A(t)®(t) for all t e [a, b]. Then equating correspond- 
ing columns of ®'(0 and A(t)<D(t), we at once have 

«K(0 = A(t)<M0, 

t e [a, b] (k = 1 , 2, ... , n). Thus each <J> t (k = 1,2 ,...,«) satisfies differential equation 
(11.15) on [a, b]. 

Finally, the statement concerning a fundamental matrix of (11.15) is now obvious 

Q.E.D. 

► Example 11.11 


0n (0 ••• <M0 ••• 

0 2l(0 02k(O 02nW 

««l(f) ••• 0»»(t) ••• 0«»(O 


Referring back to Example 11.10 and applying Theoremll.il, we see that the 
fundamental matrix 


<D(t) = 


e 2t 

e 3 ' 

3e 5t 

— e 2t 

— 2e 3 ' 

1 

ON 

-e 2t 

-e 3 ' 

— 2c 5 ' 


of the differential equation 



is such that 


m o 

dt 


/ 7 - 1 6\ 

-10 4 -12 ®(t) 


( 11 . 10 ) 
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for all real t. That is, for all real r, we have 


^ 2e 2 ‘ 

3e 3 ‘ 

15e 5, \ 


1 1 

-1 



( e 2t 

e 3t 

3e 5, \ 

— 2e 2t 

—6e 3 ‘ 

1 

U> 

o 

= 

-io 

4 

-12 


— e 2t 

-2e 3t 

—6e 5 ‘ 

y—2e 2t 

— 3e 3 ' 

-10 e 5, j 


1-2 

1 



{-e 2t 

-e 3 ' 

— 2e 5, J 


The student should carry out the indicated multiplication in the right member and thus 
directly verify the validity of this equality. 


DEFINITION 


n 

The trace of ann x n matrix A is given by the formula tr A = a n . That is, the trace of 

j=! 


A is the sum of its main diagonal elements. 


THEOREM 11.12 Abel-Liouville Formula 

Let the functions 


4>k = 


/0u\ 

02fc 


\ 0 , 


, (k = 1, 2,...,n), 


nk f 


be n solutions of the homogeneous linear vector differential equation 

dx 

T, ~ A(,)x 


(11.15) 


on the real interval [a, b]; let t 0 be any point of [a, b~\: and let W denote the Wronskian of 
(t) u (t) 2 ,-.-,<P„. Then 


W(t) = W / (t 0 )exp 


f 

Jto 


tr A (s)ds 


(11.34) 


for all t e [a, b\ 

Proof. We differentiate the Wronskian determinant 

011 012 01/i 

021 022 02/i 

0i 1 0i2 0i n 

0/i 1 0/i2 0/i/i 





11.3 
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to obtain ,, 

011 012 

021 022 

w '= i i 
0*1 01 2 

0/i 1 0/i2 

011 012 
021 022 

0/1 0/2 

0/i 1 0/i2 


0i/i 


011 

012 


<Pln 


02»i 


021 

022 


02 n 


0i/i 

+ 

0/1 

0/2 


4>in 

+ ... 

0/1/1 


0/, 1 

0/i 2 


<Pnn 


01/, 



011 

012 


<Pln 

02/i 



021 

022 


(p2n 

0 i/i 

+ ' 

... + 

0*1 

0/2 


(Pin 


0/./1 I 0/i 1 0/i2 ••• 0in 


(11.35) 


Here, and throughout the rest of the proof, primes denote derivatives with respect to t. 
Thus IT' is the sum of n determinants, in each of which the elements of precisely one 
row are differentiated. Since <|> fc , (fc = 1, 2,..., n), satisfies the vector differential 

n 

equation (11.15), we have ^ = A^, (fc = 1, 2,..., n), and so 4>' ik = £ 

(i = 1, 2, ..., n; j = 1, 2,..., w). Substitute these for the indicated derivatives in each 
of the n determinants in the preceding expression (1 1.35) for IT'. Then the ith deter- 
minant, (i = 1, 2,..., n) in (11.35) becomes 


011 

012 

01/i 


0n 

012 

<Pln 

021 

022 * 

02/i 


021 

022 

02. 





n 

n 

rt 

0/1 

0/2 

‘ • 0/n 


Z a u(pj i 

Z a ij(pj 2 

•" Z a ij(pjn 





1=1 

J=1 

J=l 

0/.1 

0/,2 “ 

* 0/1/1 


0.1 

0/,2 

0/i/i 


Writing out each of the indicated sums in the preceding determinant and using 
fundamental properties of determinants, we see that it breaks up into the following sum 
of n determinants: 



011 

012 

0i/i 


0n 

012 


(Pin 



021 

022 

02/i 


021 

022 


(p2n 


fl/i 

011 

012 

‘ 01/, 

+ a i2 

021 

022 


<P 2. 

+ ... 


0.1 

0/i2 

‘ 0/1/1 


0/i 1 

0/i 2 


(Pnn 



0n 

012 •* 

“ 0i/i 



011 

012 


(P In 


021 

022 

* 02/, 



021 

022 


(p2n 

+ a ii 

0; i 

0/2 ' ' 


+ ••• • 

T 






‘ 0/n 


0/.1 

0/i2 


4>nn 


0.i 

0/i2 

0/i/i 



0/11 

0/i2 


(Pnn 
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Each of these n determinants has two equal rows, except the ith one, and the coefficient 
of this exceptional one is a H . Since a determinant having two equal rows is zero, this 
leaves only the single exceptional determinant having the coefficient a i{ . Thus we have 


for each i = 1 , 2 ,..., n. 


Using this identity with i = 1, 2, . . . , n, we replace each of the n determinants in (1 1.35) 
accordingly. Doing so, (1 1.35) then takes the form 


0n 

012 

' 01. 


0n 

012 

0i. 

021 

022 ’ 

02. 

ci a 

021 

022 

• 02. 

0a 

0i2 - 

•• 01. 

0a 

0i2 

' • 0in 

0.i 

0.2 

• 0.. 


0.1 

0n2 

‘ 0.. 


W' = a n 


+ d:: 


011 012 
021 022 

0,1 0i2 

0.1 0.2 

011 012 
021 022 

0.1 0i2 

0nl 0.2 


01 . 

02 . 

4>t„ 

0.. 

01n 

02 . 

0i. 

0n. 


+ «22 


011 012 
021 022 

0i 1 0i2 • 

0.1 0.2 


0 In 

02 . 

0in 

0nn 


+ * * * + 


011 012 
02 1 022 

0;i 0,2 

0nl 0n2 


01 . 

02 n 

0fn 

0 tin 


That is, W" 


u=> J 


and so 


tT' = (tr A)W 


(11.36) 


That is, (11.36) states that IT satisfies the first-order scalar homogeneous linear dif- 
ferential equation 

<W(t) 


dr 


= [tr A(r)]lT(r). 


Integrating this, we at once obtain 

IT(r) = c exp 


f 

Jlo 


tr A(s) ds 


Letting t = t 0 , we find that c = H’(/ 0 ). and hence we obtain the stated Abel-Liouville 
formula 


IT(r) = IT(to) exp tr A(s) ds 


j* tr A(s) dsj . 


(11.34) 
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Exercises 

1 . This exercise deals with the verification of certain algebraic calculations referred to 
in Example 11.10. 

(a) Find the inverse of the matrix 

^ e 2t e 3t 3e 5t ^ 

— e 2t — 2e 3t —6e 5t . 

— e 2t — e 3t —2 e 5t i 


(b) Find the solution of the linear algebraic system 

c x + c 2 + 3 c 2 = - 1, 
-Ci - 2 c 2 - 6 c 3 = 4, 

Ci c 2 2c 3 — 2. 

2. (a) Show that 


is a fundamental matrix of the linear system 


6 -1 


(b) Use Formula (1 1.27) of Theorem 11.10 to find the unique solution <|> of this 
linear system that satisfies the initial condition 


+( 0 ) = 


(c) Verify directly that O defined in part (a) satisfies Formula (1 1.33) of Theorem 

11 . 11 . 


3. (a) Show that 


5e 3t -3e- 


is a fundamental matrix of the linear system 

dx l— 2 l\ 


(b) Use Formula (1 1.27) of Theorem 1 1.10 to find the unique solution <|> of this 
linear system that satisfies the initial condition 


(c) Verify directly that <J> defined in part (a) satisfies Formula (1 1.33) of Theorem 

11 . 11 . 
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4. (a) Show that 


0(0 


— 2e x 2e 5t 


is a fundamental matrix of the linear system 


dx 

It 


3 1 

4 3 


x. 


(b) Use Formula (1 1.27) of Theorem 1 1.10 to find the unique solution <|> of this 
linear system that satisfies the initial condition 


m = 


(c) Verify directly that ® defined in part (a) satisfies Formula (1 1.33) of Theorem 

11 . 11 . 

5. (a) Show that 



lie' 

e 2 ' 0 ' 

II 

e 

2e‘ 

2e 2t 3e 5t 


\ e ‘ 

e 2t e 5t / 


is a fundamental matrix of the linear system 

0 -2 




(b) Use Formula (1 1.27) of Theorem 1 1.10 to find the unique solution <|> of this 
linear system that satisfies the initial condition 



(c) Verify directly that O defined in part (a) satisfies Formula (1 1.33) of Theorem 

11 . 11 . 


6. (a) Show that 


( t 


®(0 = 


„3t 


„3 1 


e e 
3e‘ 2e 3t 


-2 1 


~2 1 


is a fundamental matrix of the linear system 

Hi:!!)- 
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(b) Use Formula (1 1.27) of Theorem 1 1.10 to find the unique solution <)> of this 
linear system that satisfies the initial condition 


(c) 


<K0) = 


1\ 
0 . 

W 


Verify directly that <J> defined in part (a) satisfies Formula (1 1.33) of Theorem 

11 . 11 . 


7. Let be a fundamental matrix of dx/dt = A (t)x on [a, 6], and let C be a constant 
nonsingular matrix. 

(a) Show that <DC is also a fundamental matrix of dx/dt = A (t)x on [a, b\ 

(b) Show that in general C<l> is not a fundamental matrix of dx/dt = A (t)x on 

[a, b]. 

(c) Determine B such that C<I> is a fundamental matrix of dx/dt = B(t)x on [ a , b~\. 


11.4 THE NONHOMOGENEOUS LINEAR SYSTEM 

We return briefly to the nonhomogeneous linear vector differential equation 

^ = A(t)x + F(t), (11.6) 

where A (t) is given by (1 1.4) and F (t) and x are given by (1 1.5). We shall see the solutions 
of this nonhomogeneous equation are closely related to those of the corresponding 
homogeneous equation 

dx 

— = A(r)x. (11.15) 


THEOREM 11.13 

Let <|> 0 be any solution of the nonhomogeneous linear vector differential equation 

dx 

— = A(t)x + F(t); (11.6) 

let <)>!, <J> 2 ? • . . , <(>„ be a fundamental set of solutions of the corresponding homogeneous 
differential equation 

dx 

— = A(r)x; (11.15) 


n 

* 0+1 c k$k 


k= 1 


and let c l ,c 2 ,... 9 c n be n numbers. 
Then: (1) the vector function 


(11.37) 
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is also a solution of the nonhomogeneous differential equation {11.6) for every choice of 
c 1 > ^ 2 > • • • > C n > and 

(2) an arbitrary solution cj) of the nonhomogeneous differential equation {11.6) is of the 
form {11.37) for a suitable choice of c i9 c 2 ,..., c n . 

Proof. (1) We show that (11.37) satisfies (11.6) for all choices of c l9 c 2 ,..., c n . We 
have 

aH- [.?, •H- 

Now since <J> 0 satisfies (1 1.6), we have 

= A(r)4»o(0 + F(t); 

and since by Theorem 11.2, £j =1 c t <j) k satisfies (1 1.15), we also have 

j t j^Z c fc4>*(oJ- 

Thus 

J t [<M0 + i c*4*(t)] = A(t)4oW + F(0 + A (t)[i c 4 4*(t)J 
= A(t) [ m + i c k 4» k (t)J + F(t). 

That is, 

d^f{t) 

= a ( tm ) + fw 

where 

n 

'I' = ^0 + Z 

fc = l 

and so 

n 

♦ = 4>o + E 

fc = l 

is a solution of (11.6) for every choice of c l9 c 2 ,..., c„. 

(2) Now consider an arbitrary solution of (1 1.6) and evaluate the derivative of the 
difference <t» - <t»o- We have 

Since both <|> and <|> 0 satisfy (1 1.6) we have respectively 

^ = A(t)4>(r) + F(t), 

= A(t)<J>o(t) + F(t). 
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Thus we obtain 

j t MHO - 4>o(01 = [A(04>(0 + F(t)] - [A(t)<j> o (0 + F(t)], 
which at once reduces to 

j t [<t>(0 - <M0] = A(0MH0 - 4»o(t)3 

Thus <t> — <|>o satisfies the homogeneous differential equation (1 1.15). Hence by Theorem 
1 1.8, there exists a suitable choice of numbers c l9 c 2 , . . . , c n such that 

n 

<l> — <t>o = Z 

k = 1 

Thus the arbitrary solution c(> of (1 1.6) is of the form 

4 = <l>o + t c*4>* (11.37) 

k = 1 

for a suitable choice of c l9 c 2 , . . . , c n . Q.E.D. 


DEFINITION 

Consider the nonhomo geneous linear vector differential equation (11.6) and the 
corresponding homogeneous linear vector differential equation (11.15). By a general 
solution of (11.6), we mean a solution of the form 

Cibi + c 2 ^ 2 + ••• + c n $n + <l>o> 

where c i ,c 2 ,...,c n are n arbitrary numbers , a fundamental set of 

solutions of (11.15), and c|> 0 is any solution of (11.6). 


► Example 11.12 


Consider the nonhomogeneous differential equation 


7 

-1 

6 1 


/ — 5t — 6 \ 

-10 

4 

-12 

X + 

-At + 23 

l- 2 

1 

-'1 


l 2 / 


and the corresponding homogeneous differential equation 


7 

-10 

-1 

4 

6 

-12 

X, 

fxA 

where x = x 2 \ 

I ' 2 

1 

- 1 


w 


(11.3) 


( 11 . 2 ) 


These were introduced in Example 11.1 where they were written out in component 
form; and (1 1.2) has been used in Example 11.8 and several other examples as well. In 
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Example 1 1.8 we observed that <j> 1( <(> 2 , <t> 3 defined respectively by 

<M0 


l e2t ) 

-e 2 ' 

5 

= 

e 3 ' ^ 
— 2e 3 ' 

, and <(> 3 (0 = 

^ 3e 5t ^ 
—6e 5 ' 

\-e 2 'j 



-‘“1 


[~ 2 H 


( 11 . 22 ) 


form a fundamental set of the homogeneous differential equation (1 1.2) [or (1 1.10) as it 
is numbered there]. Now observe that the vector function <(> 0 defined by 


<M £ ) = 


2t \ 

3t-2 

\~ t+l l 


is a solution of the nonhomogeneous differential equation (11.3). Thus a general 
solution of (1 1.3) is given by 


X = C!<Mt) + C 2 <M0 + C 3 <t> 3 (t) + <t>o(0> 


that is, 



( e2 ‘ \ 

e 3 ' \ 

X = C l 

-e 2 ‘ \+c 2 

— 2e 3 ' 


l-W 

{-e 2 ‘j 




where c u c 2 , and c 3 are arbitrary numbers. Thus a general solution of (11.3) can be 
written as 


*i = c i e2t + c 2 e3t + 3c 3 e 5t + 2 1, 

x 2 = —c l e 2t — 2 c 2 e 3t — 6 c 3 e 5t + 3f — 2, 


x 3 = —c 1 e 2t — c 2 e 3t — 2c 3 e 5r — t + 1, 


where c l5 c 2 , and c 3 are arbitrary numbers. 


We now develop a systematic procedure for finding a solution of the nonhomo- 
geneous linear vector differential equation 

dx 

— = A(t)x + F(t), (11.6) 

assuming that we know a fundamental matrix of the corresponding homogeneous 
linear vector differential equation 

~ = A(t)x. (11.15) 

Specifically, let <I>(t) be a fundamental matrix of (11.15) on [ a , 6]. Then by the note 
immediately following Theorem 1 1.9, the general solution of (1 1.15) can be expressed in 
the form 


x(f) = <D(f)c, (11.38) 

where c is an arbitrary n-rowed constant vector. We shall now proceed as we did in the 
analogous case of the single nth-order nonhomogeneous linear differential equation in 
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Section 4.4, that is, by variation of parameters. We thus replace the constant vector c in 
(11.38) by the vector function v(£), thereby obtaining 

x(t) = <D(t)v(t), (11.39) 

where \(t) will be determined so that (1 1.39) will be a solution of the nonhomogeneous 
differential equation (1 1.6). 

Differentiating (1 1.39), we obtain 

x'(£) = 0>'(£)v(£) + <J>(£)v'(£); (1 1.40) 

and substituting (1 1.39) and (1 1.40) into (1 1.6) we find 

<b'(t)y(t) + d y(t)y'(t) = A(r)[<D(r)v(t)] + F (t). (1 1.41) 

Since Q>(t) is a fundamental matrix for the homogeneous differential equation (11.15), 
by Theorem 11.11, <&'(£) = A(t)<b(t). Hence <S>'(t)y(t) = A(£)<D(£)v(£), and so (11.41) 
reduces to 


<* = f (r). 

Since <b(t) is a fundamental matrix of (1 1.6), by Theorem 1 1.6 we know that \Q>(t)\ ^ 0 
on [a, b] and hence <I>" l (t) exists and is unique on this interval. Hence, we at once have 

v'(£) = <D- 1 (£)F(£); 

and integrating, we obtain 

y(t)= I <S>~ l (u)F(u) du, 

Jto 

where t 0 and t e [ a , b~\. Thus a solution of the assumed form (1 1.39) is given by 

x(t) = Q>(t) | Q>- l (u)F(u) du. 

Jto 

We shall now show that every function of this form is indeed a solution of the 
nonhomogeneous differential equation (1 1.6). 


THEOREM 11.14 

Let be a fundamental matrix of the homogeneous linear vector differential equation 
dx/dt = A (£)x on [a, b~\. Then <D 0 defined by 

<|> 0 (£) = m Q>~ l (u)F(u) du, (11.42) 

Jto 

where t 0 e [a, b\ is a solution of the nonhomogeneous linear vector differential equation 


on [a, b~\. 


— = A(t)x + F(t) 


( 11 . 6 ) 
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Proof. Differentiating ( 1 1 .42), we find 


4 >o(0 = *'(t) <D" 1 (m)F(m) du + <S>(t)<S>~ l (t)F(t) 

Jto 

= <D'(t) <S>~ l (u)F(u)du + F(t). 

Jto 

Since is a fundamental matrix of the homogeneous differential equation 
dx/dt — A (r)x, by Theorem 11.11 we have <D'(r) = A(r)d>(f). Thus we find that 


♦o(t) = A(f)«D(t) <D“‘(u)F(u) du + F(r). 


Also from (1 1.42), we find 


(11.43) 


A(f)4> 0 (0 + F(t) = A(r)<D(t) I <t>~ l (u)F(u) du + F(t). (1 1.44) 

J<0 

Comparing (1 1.43) and (1 1.44), we see that <(> 0 (r) defined by (1 1.42) is such that 

4»o(0 = A(t)<M0 + F (r); 

that is, every function <|> 0 (f) defined by (1 1.42) satisfies the nonhomogeneous differential 
equation (11.6). Q.E.D. 


► Example 11.13 


Consider the nonhomogeneous differential equation 

dx (6 — 3\ fe 5 ' 

*-(2 iJ x+ U 

Clearly this is of the form (1 1.6), where 

*<'>-(2 "?)• w-OO' ■ 


The corresponding homogeneous differential equation is 

dx /6 — 3\ 


dt \2 1 


(11.45) 


(11.46) 


This is the system (7.79) of Example 7.17 (in different notation). In that example, we 
found that 


<M0 = (O and <M0 = ( 2^4, 


(11.47) 


constitute a fundamental set (pair of linearly independent solutions) of (1 1.46). Thus a 
fundamental matrix of (1 1.46) is given by 


e 3 ‘ 3e 4 ‘ 
e 3 ‘ 2e 4t 


(11.48) 
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By Theorem 1 1.14 we know that 

4>o(t) = *(r) I ® _1 (u)F(u) du 

J lo 

is a solution of (1 1.45) for every real number t 0 . For convenience, we choose t 0 — 0. We 
thus proceed to evaluate 


<mo = m 


<D _1 (u)F(u) du, 


(11.49) 


where <D(f) is given by (1 1.48) and F(u) = ( ” ). We find that 


,5a 


<l> _1 (u) = 


-2e 

„-4 a 


3u 3e -3u 
-4e 


Thus (1 1.49) becomes 
4>o(t) = 


,3l 


,3< 


C 3 ‘ 

C 3 ' 

2e 5 ' 


3c 4 ' 

2e 

3e 

2e 

3e 4 ' 

2e 4 ' 


— 2e 

— 4 u 


-)£( 


-3 u 


3e~ 3u 

-4a 


e —e 

2e 2u + 1 2e ~ 3 " 
- 4e~ 4u 

— e 2 ' — 4e~ 3 ' + 5 
e' + e~ 4, -2 


du 


du 


1 + 5e 3 ' - 6e 4 ' 
e 5t -2 + 5e 3t -4 e M )' 

This, then, is a solution of the nonhomogeneous differential equation (11.45). In 
component form it is 

5e 3t - 6e At + 2e 5t - 1, 

5e 3 ' - 4e* + <? 5 ' - 2. 


*2 


Observe that it can also be expressed in the form 

4>o(t) = 5<M0 - 2<M0 + M), (11-50) 

where and <J> 2 are the fundamental set of homogeneous differential equation (1 1.46) 
defined by (1 1.47) and c| )$(t) is given by 

'2e 5t -V 

e 5t — 2 


4 > 5(0 


The reader should verify that both c)x 0 ( t ) and <|>g(t) satisfy the nonhomogeneous 
differential equation (11.45) and he should then observe how (11.50) thus illustrates 
Theorem 11.13 conclusion (1). 


THEOREM 11.15 


The unique solution c|> of the nonhomogeneous linear vector differential equation 
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that satisfies the initial condition 


Wo) = x o> 

where t 0 e [a, b\ can be expressed in the form 


d> 1 (u)F(u) du. 


(11.51) 


(11.52) 


where Q>(t) is an arbitrary fundamental matrix of the corresponding homogeneous vector 
differential equation 


dx 

* = awx 


(11.15) 


Proof. By Theorem 11.13, Conclusion (2), the solution <|> can be expressed in the 
form 


W) = I cA(t) + i(ft (T 1.53) 

k = 1 

where the c k are suitably chosen constants, the are a fundamental set of (11.15), 
and <|> 0 is any solution of (11.6). The proof of Theorem 11.9 shows that the linear 
combination ]Tj =1 c k ty k in (11.53) can be written as Oc, where <I> is the fundamental 
matrix of (11.15) having the ty k as its individual columns. Also, by Theorem 11.14 a 
solution <|> 0 of (1 1.6) is given by 

<M0 = W) P <t>~ l (u)F(u)du, 

Jto 

where is this same fundamental matrix of ( 1 1 . 1 5). Thus the solution <|> given by ( 1 1 .53) 
takes the form 


<|>(r) = <D(*)c + 4>(f) I G>~ l (u)F(u)du. (11.54) 

Jto 

Now apply the initial condition <|>(T 0 ) = x 0 , Equation (1 1.51). Letting t = t 0 in (1 1.54) 
we obtain 


Wo) = Wo)c + 0 

(since the integral from t 0 to t 0 is zero); and hence application of the initial condition 
gives 

*0 = Wo)c. 

Since |O(t 0 )| # 0, <D“ l {t 0 ) exists and we find 

O " 1 (t 0 )x 0 = <D “ 1 (r 0 ) Wo)c = Ic = c. 

Thus substituting this value of c back into (1 1.54) we obtain the solution in the form 

<D -1 (m)F(w) du . 


cKt) = d>(t)<I>- 1 (to)Xo + W) 


(11.52) 

Q.E.D. 
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► Example 11.14 


Find the unique solution <|> of the nonhomogeneous differential equation 



that satisfies the initial condition 



(11.45) 


(11.55) 


By Theorem 1 1.15, we know that the desired solution can be expressed in the form 
(11.52), where <D(t) is a fundamental matrix of the corresponding homogeneous 
differential equation 


dx 

dt 



(11.46) 


and where 



and 



From Example 1 1.13 we know that a fundamental matrix of (1 1.46) is given by 


<D(t) = 

In that example we also found that 



3c 4 ' \ 
2e 4 '/ 


d>- 1 (t) = 



3c- 3 ' \ 
-c- 4 'J’ 


and from this we at once obtain 


<D _1 (0) = 




(11.48) 


Hence, applying (11.52) with these assorted matrices, vectors, and values, we see that 
the desired solution is given by 


<K0 = 



3e 4 '\ 

2c 4 'J 


-2 3\/9\ /e 3 ' 3e 4 '\ f' / — 2c _3 “ 

1 -7W + U 3 ' 2e / Jo \ 



(11.56) 


The first term on the right quickly reduces to 


/c 3 ' 3e 4, \/-6\ _ /-6c 3 ' + 15c 4 '\ 
\e 3 ' 2c 4 7V 5/ ~ V-6c 3 ' + 10e 4 7 


(11.57) 


The second term on the right is precisely the solution <|> 0 (t) of Example 11.13, and this 
was evaluated there and found to be 


5c 3 ' - 6e 4 ' + 2c 5 ' - 1 
5c 3 ' — 4c 4 ' + c 5 ' — 2 


(11.58) 
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Adding these two constituent parts (11.57) and (11.58) of (11.56), we find the desired 
solution 


m = 


— e 3t + 9e 4t + 2e 5t — 1\ 

— e 3t + 6e 4t + e 5t — 2) 


In component form this is 


x t = —e 3t + 9e 4t + 2e 5t — 1, 
x 2 = —e 3t + 6e 4t + e 5t — 2. 


Exercises 


For each of the nonhomogeneous linear systems in Exercises 1-6, 

(a) find a fundamental matrix of the corresponding homogeneous system, and 

(b) find a solution of the given nonhomogeneous system. 


1 . 

2 . 

3. 

4. 

5. 

6 . 



dx /3 1\ /— 2 sin t \ 

dt \4 3/ + \ 6 cos t) 

dx /l — 5\ / 5 cos 2A 

~di = \ 2 — iy X + V — 5 sin /' 


7. Find the unique solution <|> of the nonhomogeneous linear system 


dx / — 10 6\ /10c 3t \ 

~di = \-n 7y X + \18e“ 3 7' 


that satisfies the initial condition cj)(0) = 



8. Find the unique solution cj) of the nonhomogeneous linear system 


dx ( -1 \\ {3e 4t \ 

dt ~\- 12 6/ + \8e 4 '/ 


that satisfies the initial condition <(>(0) 


2 ' 

4 
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11.5 BASIC THEORY OF THE nth-ORDER HOMOGENEOUS LINEAR 
DIFFERENTIAL EQUATION 


A. Fundamental Results 


Throughout Sections 11.5 and 11.6 we shall be concerned with the single nth-order 
homogeneous linear scalar differential equation 


d"x d n ~ l 


x , x dx / v 

r + ' ' ' + a n- 1 (0 — + a»(t)x = 0, 


dt 


(11.59) 


where a 0 , a t ,..., a K - lt a n are continuous on an interval a <t <b and a 0 (t) ^ 0 on 
a < t < b. Let L„ denote the formal nth-order linear differential operator defined by 


d" d"~ 

L " = a ° (t) dP i + ai(t) dt ;r - 


T + ' " + a n- 1 (0 ^ + Cl n( ! )- 


(11.60) 


Then differential equation (1 1.59) may be written 

L n x = 0. 

If we divide through by a 0 (t) on a < t < b, we obtain the equation 

d"x a^d'-'x a,-i (t) dx a.(t) 

dt" a 0 (t) dt"' 1 a 0 (t) dt a 0 (t) 


(11.61) 


(11.62) 


d n x 

This equation, in which the coefficient of is 1, is said to be normalized. 

The basic existence theorem for the initial-value problem associated with the nth- 
order homogeneous linear scalar differential equation (11.59) is the following special 
case of Theorem 10.8. 


THEOREM 11.16 
Hypothesis 

1. Consider the differential equation 

d n x d n ~^x dx 

a o (0 -jpt + «i(0 -jprr + "• + «»- 1(0 + a n(t) x = o, (l 1.59) 

where a 0 , a u . . . , l5 and a„ are continuous on the interval a < t < b, and a 0 (t) # 0 on 
a < t < b. 

2. Let t 0 be a point of the interval a < t < b, and let c 0 , c l9 . ... , c„_ i be a set of n real 
constants. 

Conclusion. There exists a unique solution (f) of (11.59) such that 

<p(t 0 ) = c 0 , <p'(t 0 ) = c l ,...,<p ( "- i) (t 0 ) = c„- 1 , (11.63) 

and this solution is defined over the entire interval a < t < b. 
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The following important corollary to this theorem will soon be very useful to us. 


COROLLARY 

Hypothesis. The function 0 is a solution of the homogeneous equation (11.59) such 
that 

<K* o) = 0, 0'(f o ) = 0, . . . , (t> in - l \t 0 ) = 0, (1 1.64) 

where t 0 is a point of an interval a < t < b on which the coefficients a 0 ,a l9 ...,a n are all 
continuous and a 0 (t) # 0. 

Conclusion. 0(t) = 0 for all t such that a < t < b. 


Proof. First note that 0 such that 0(t) = 0 for all t e [a, b~\ is indeed a solution of 
the differential equation (11.59) which satisfies the initial conditions (11.64). But by 
Theorem 1 1.16 the initial-value problem composed of Equation (1 1.59) and conditions 
(11.64) has a unique solution on a < t < b. Hence the stated conclusion follows. 


In the statements of this theorem and corollary we have explicitly stated the 
hypothesis that the coefficients a 0 , a x , . . . , a n are all continuous on the interval a <t <b 
under consideration and that a 0 (t) # 0 there. In the remainder of this section and 
in Section 11.6, we shall always make this assumption whenever we refer to the 
homogeneous linear differential equation (11.59). That is, we always assume that in 
homogeneous linear differential equation (11.59) the coefficients a 0 , a t ,..., a n are all 
continuous on the interval a < t < b under consideration and that the leading coefficient 
a 0 (t) # 0 on this interval. 

In the proofs of Theorem 10.6 and 10.8 we noted that a single nth-order differential 
equation is closely related to a certain system of n first-order differential equations. 
We now investigate this relationship more carefully in the case of the nth-order 
homogeneous linear scalar differential equation 


d n x d n 

«*>*+*.»*= 


; dx 

-+••• + a n - ft) — + a n (t)x = 0. 


(11.59) 


Let 


x i = x 


x 2 


dx 

dt 


*3 


d 2 x 


x 


n- 1 


d n 2 x 
dt n ~ 2 


d n l x 


(11.65) 




dt n ~ l 
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Differentiating (1 1.65), we have 

dx dx i d 2 x dx 2 d n ~ l x dx n - t d n x dx n 
~di = ~di ’ ~dt 2= ~dt' > "" dt n ~ l= dt ’ ~dF = ~di • 


The first n — 1 equations of (1 1.66) and the last n — 1 equations of (1 1.65) at once give 


dx t dx 2 dx n _ i 

— = x 2 , — = x 3 ,..„ — = x„. 

Now assuming a 0 (t) # 0 on a < t < b, (1 1.59) is equivalent to 

d n x a n (t) cin-^dx a l (t)d n ~ l x 

dt n a 0 (t) a 0 (t) dt a 0 (t) dt n ~ l 

Using both (1 1.65) and (1 1.66), this becomes 

dx n _ a n (t) a n . t (t) a x (t) 

dt «o(0 «o(0 X ”‘ 

Combining (1 1.67) and (1 1.68), we have 

dx x _ 
dt ~ Xl 

dx 2 

-1F = X > 


dX *~l - y 

dt 

dt a 0 (t) X ' a 0 (t) Xz a 0 (t) 


(11.67) 


( 11 . 68 ) 


(11.69) 


This is a special homogeneous linear system of the form (1 1.1). In vector notation, it 
is the homogeneous linear vector differential equation 

dx 

— = A(l)x, (11.70) 

where 





1 0 

1 

0 

• 0 

0 


l x l\ 


1 0 

0 

1 

• 0 

0 


x 2 







X = 


and A = 

0 

0 

0 

0 

1 


\ X J 


1 

a n- 1 

0»- 2 

_02 

_01_ 




\ 00 

0 0 

a 0 

0Q 

0 0/ 


The equation (11.70) is sometimes called the companion vector equation of nth-order 
scalar equation (1 1.59). 
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Now suppose / satisfies the nth-order homogeneous linear differential equation 
(11.59). Then 

a 0 (t)f (n) (t) + fl,(0/ < " -1) (0 + ••• + + a„(t)f(t) = 0, (11.72) 


for t e [a, b]. Consider the vector <j> defined by 


0 1 (t ) \ 


/0(f) 

02(f) 


/'(t) 

03(f) 


/"(f) 

0»-i(f)j 


,/ ( »- 2) (t) 

\0„(t) / 


\/ ( "“ 1, (f)/ 


From (1 1.72) and (1 1.73) we see at once that 


(11.73) 


4>\(t) = 0 2 (t), 

02(0 = 03(O> 


0»-l (0 = 0„(O, 


m = 


« n (0 

«o(0 


0i(O 


a 0 (t) 


02(f) -••• 


«l(f) 

«o(0 


0n(f )• 


(11.74) 


Comparing this with (11.69), we see that the vector c|> defined by (11.73) satisfies the 
system (11.69). 

Conversely, suppose 


wo = 


/0i( f)\ 
02(f) 

^0n(t) / 


satisfies system (1 1.69) on [a, b]. Then (1 1.74) holds for all t e [a, b], The first n — 1 
equations of (1 1.74) give 


02(f) = 01 (f ) 

0 3 (f) = 02(f) = 0i(t) (11-75) 


0,(0 = 0«-l(f) = 0n- 2 (t) = ••• = 0 1 l B ' 1, (f), 
and so 0'„(O = 0 l " ] (t). The last equation of (1 1.74) then becomes 


0 l r ] (f)=-^0 i(f)- a ' 


a 0 (f) 


«o(f) 


1(0 0 i (f ) — ?# 0r i, (t) 


«o(f) 


or 


a o (f)0 [ i B, (f) + fli(t)0 [ i" ll (f) + ••• + a„-i(f)0i(f) + a n(f)0i(t) = 0. 

Thus 0! is a solution / of the nth-order homogeneous linear differential equation 



11.5 
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(1 1.59) and moreover (1 1.75) shows that, in fact, 

//(o \ 

/'(f) 

m = /"(f) • 

\/ ,B “ 1) (t)/ 

We have thus obtained the following fundamental result. 

THEOREM 11.17 

Consider the nth-order homogeneous linear differential equation 

d"x d"~ l x dx 

a o( f ) dt” + + "‘ + + a »( t ) x = 


(11.59) 


and the corresponding homogeneous linear system (11.69). If f is a solution of (11.59) on 
[a, b], then 

If \ 

f 

*= /" (H.76) 

1>- 1 7 

is a solution of (11.69) on [a, b ]. Conversely , 1 / 


is a solution of (11.69) on [a, b], tbcn its first component (f) l is a solution f of (11.59) on 
[a, b] and c|> is, in /act, 0 / the form (11.76). 

The following corollary is an immediate consequence of the theorem. 


COROLLARY TO THEOREM 11.17 

If f is the solution of the nth-order homogeneous linear differential equation (11.59) on 
[a, b] satisfying the initial condition 

f(to) = c oi f'(t 0 ) = c i> /(^o) = / ( ” 1 \to) = c n-ii 


then 
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is the solution of corresponding homogeneous linear system (11.69) on [ a , b~\ satisfying the 
initial condition 


Conversely , if 


+(*o) = 



\ Cn l 


+ = 


/<M 

0. 2 


w 


is a solution of (11.69) on [a, fr] satisfying the initial condition 


0l(*o) — C 0 > 02 (^o) — C l5 03^o) - C 2 >-*-> 0»(*o) — 


rten is t/ze solution f of (11.59) on [a, b] satisfying the initial condition 

f(t 0 ) = c 0 , f'(t 0 ) = c u /"(to) = c 2 ,...,/ ,n “ 1) (fo) = c n - 1 . 

We shall now employ these stated relationships between the solutions of (1 1.59) and 
( 1 1 .69) to obtain results about the solutions of ( 1 1 .59) from the corresponding theorems 
about solutions of (11.69). We first obtain the following basic result. 


THEOREM 11.18 

A linear combination of m solutions of the nth-order homogeneous linear differential 
equation 

d n x d n dx 

a ° (0 + «l(0 ~fcn r T + "•+«»- l(0 + a n(t)x = 0 (1 1.59) 

is also a solution of (11.59). That is, if the functions f i , f m are m solutions of 

(11.59) and c l9 c m are m numbers, then the function 

m 

f = Z cj k 

k= 1 

is also a solution of (11.59). 

Proof. By hypothesis, are solutions of (11.59). Therefore by 

Theorem 11.17, 


//l \ 


//* 1 


IL 

f\ 1 

9 4^2 = 

f'l 

• 


fm 

uH 


i/r 1 ') 


V m 1 
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are solutions of homogeneous linear system (1 1.69). Then by Theorem 1 1.2, 

m 

«l> = Z C k$k 

k= 1 

is also a solution of (1 1.69). That is, 

£ cjk 

k= 1 
m 

Y c kf'k 

k= 1 

\i 

is a solution of (11.69). Then by the converse part of Theorem 1 1.18, the first 
component of this, 

m 

Y C kfk-> 

k= 1 

is a solution of (1 1.59). Q.E.D. 

We now show that the relationship between solutions of (1 1.59) and (1 1.69) preserves 
linear independence. We first review the definitions of linear dependence and 
independence for solutions of (1 1.59) and solutions of (1 1.69). 


B. Fundamental Sets of Solutions 
DEFINITION 

Them solutions , f 2 , . . . , f m of the nth-order homogeneous linear differential equation 
(11.59) are linearly dependent on a < t < b if and only if there exist m numbers 
c u c 2 ,..., c m , not all zero , such that 

C l/l (0 + £2/2(0 + ' ’ ’ + 0 

for all t such that a < t < b. 


DEFINITION 

The m solutions f x , f 2 , . . . , f m °f the nth-order homogeneous linear differential equation 
(11.59) are linearly independent on a < t < b if and only if they are not linearly 
dependent on the interval. That is, the solutions fi,f 2 >---’fm are linearly independent on 
a < t < b if and only if 

c ifi(t) + £ 2 / 2(0 + * ’ * + C mfm(t) = 0 
for all t such that a < t < b implies that 

Ci= c 2 = ... = Cm = 0 . 
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DEFINITION 

The m solutions cj)^ <|> m of the homogeneous linear system (11.69) are linearly 

dependent on a < t < b if there exists m numbers c 2 , . . . , c m , not all zero , such that 

Cl<M0 + C 2*M0 + “ * + = 0 

for all t such that a < t < b. 


DEFINITION 

The m solutions <J> l5 <|> 2 ,..., c|> m of the homogeneous linear system (11.69) are linearly 
independent on a < t < b if and only if they are not linearly dependent on this interval. 
That is , the solutions <(> 2 , . . . , <|> m are linearly independent on a < t < b if and only if 

the relation 


C 1<M0 + C 2<M0 + '" + ~ ® 

for all t such that a < t < b implies that 

Ci= c 2 = ... =Cm = o. 


THEOREM 11.19 

If the solutions / 1? f„ of the nth-order homogeneous linear differential equation 

(11.59) are linearly independent on a < t < b, then the corresponding solutions 





If* 


(fn \ 

<t>l = 

f'l 

> 2 — 

'f'2 

<1 >n = 

f'n 


\pr l) l 




\/H 


of the homogeneous linear system (11.69) are linearly independent on a < t <b; and 
conversely. 

Proof. If /i , / 2 , . . . , are linearly independent on a < t < b, then the relation 

cJdt) + c 2 f 2 (t) + --' + c n f n (t) = 0 (11.77) 

for all t e [a, b~\ implies that c Y = c 2 = * * * = c n = 0. Now suppose 

c i < MO + c 2 <t> 2 (0 + *“ + c„ty„(t) = 0 (11.78) 

for all t g [a, b]. The first scalar component relation of the vector relation (11.78) is 
precisely (11.77). So if (11.78) holds for all t g [a, b], so does (11.77), and this implies 


c t = c 2 = * * * = c„ = 0. Thus if (1 1.78) holds for all t g [a, b ] , then we must have c i = 
c 2 = • • • = c„ = 0; and therefore <|>i , c|) 2 , . . . , <J>„ are linearly independent on a < t < b. 


Now suppose /i, f 2 , f n are linearly dependent on a < t < b. Then there exist 
numbers c l9 c 2 ,..., c„, not all zero, such that 

+ ^ 2 / 2(0 + “ * + c n f n (t) = 0 


(11.79) 
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for all t g [ a , b~\. Differentiating this n — 1 times we obtain 

c i/i (0 + ^2X2(0 + “‘ + c n fn(t) = 

C l/l(0 + C 2/5(0 + “‘ + C nfn( t ) — 

. ( 11 . 80 ) 

cjr 1 i \t) + c 2 /r : 1) (o + ••• + c./?- 1 ^) = 0, 

all of which hold for all t e [a, 6]. Relations (11.79) and (11.80) together are n scalar 
relations which are equivalent to the single vector relation 

C 1<M0 + C 2<M0 + + C n$n( t ) = 0 (11.81) 


for all t g [a, b], Thus there exist numbers c 1? c 2 ,..., c„, not all zero, such that (11.81) 
holds for all t g [ a , b]; and so <(>!, <J> 2 , — , are linearly dependent on a < t < b. Thus 
the linear dependence of fi,f 2 ,...,f„ona<t< b implies that of on the 

same interval; and so the linear independence of <(>!, ••><!>„ on a < t < b implies 

that of / 1 , f n on this interval, which is the converse part of the theorem. 

Q.E.D. 


DEFINITION 

Let / 1? /„ be n real functions, each of which has an (n — 1 )st derivative on 

a < x < b. The determinant 

/1 f 2 f„ 

f\ f f 2 ••• /; 


l/r i) /r 1 * /f i, l 

w/icrc t/ic primes denote derivatives, is called the Wronskian of the n functions f u f 2 , . . . , 
/„. PTc denote it by W(f t , / 2 ,..., /„) and denote its value at t by W(f u f 2 ,...,f n ){t). 

Let / 1 , / 2 be n solutions of the nth-order homogeneous linear differential 

equation (1 1.59) on a < t < b, and let 


II 

\ 

. <t>2 = 

fl 

1 : 


If. \ 

fn 


^ . 
1 




[fH 


be the corresponding solutions of homogeneous linear system (1 1.69) on a < t < b. By 
definition, the Wronskian of the n solutions f x , f 2 of ( 1 1 .59) is 

/l $2 fn 

f\ f f 2 ••• /: 


l/r l) /r u ••• /r l, l 

Now note that by definition of the Wronskian of n solutions of (1 1.69) (Section 1 1.2), 
this is also the Wronskian of the n solutions <|>i, <|> 2 ,..., <!>„ of (11.69). That is, 

m ) =W(i )>! , <t> 2 , . . . , <t>„)(0 


(11.82) 
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for all t e [a, ft]. By Theorem 11.5, either W(ty l9 <t> 2 , . . . , <(>„)(t) = 0 for all t e la, ft] or 
= 0 for no te[a, ft]. Thus either W(f l ,f 2 ,...,f n )(t) = 0 for all 
t e la, ft] or W(f l ,f 2 ,...,f n )(t) = 0 for no tela, ft]. Thus we have obtained the 
following result. 


THEOREM 11.20 

Let /i , / 2 ft^ n solutions of the nth-order homogeneous linear differential equation 

(11.59) on a < t < ft. Then either W(f x , / 2 ,..., f n )(t) = 0 for all f 6 [a, ft] or 
^(/i,/ 2>--->/«)(0 = 0 for no tela, ft]. 

By Theorem 1 1 . 1 9 the solutions f i , f 2 of ( 1 1 .59) are linearly independent on 

la, ft] if and only if the corresponding solutions 


II 

ft ) 

, <t>2 = 

ft ^ 

II 

s 

If. \ 

fn 

1 

i fr l) j 



I 1 

\/r l 7 


of (1 1.69) are linearly independent on [a, ft]. By Theorem 1 1.6, the solutions <J>i, <t > 2 9 , 

<(>„ of (11.69) are linearly independent on la, ft] if and only if W(^ l , c(> 2 , , <t>„)(0 # 0 

for all t e la, ft]. Thus, using (1 1.82), <J> l5 <|> 2 , . . . , <|> w are linearly independent on [a, ft] if 
and only if W(f l9 / 2 ,..., f„)(t) ^ 0 for all t e la, ft]. Combining this with the first 
sentence of this paragraph, we see that f i , f 2 , ... , f n are linearly independent on [a, ft] if 
and only if W(f x , f 2 , . . . , f„)(t) ^ 0 for all t e la, ft]. Thus we have the following results. 


THEOREM 11.21 

Let /i, f 2 , ...,/„ be n solutions of the homogeneous linear differential equation 

d n y d n dx 

a o(t)-^ + ai(t)-jp^T + ■■■ + + a n (t)x = 0 (11.59) 

on la, ft]. These n solutions /i , / 2 , . . . , of (11.59) are linearly independent on [a, ft] if 

and only if 

for all t e [a, ft]. 

Using this result and that of Theorem 11.20, we at once obtain the following 
corollary. 


COROLLARY TO THEOREM 11.21 


Let f Y , f 2 ,---, f n be n solutions of the homogeneous linear differential equation (11.59) 
on la, ft]. These n solutions are linearly dependent on la, ft] if and only if 
W(fi,f 2 ,. . . , /„)(!) = 0 for all t e [a, b]. 
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► Example 11.15 

Consider the third-order homogeneous linear differential equation 

, d 3 x A i d 2 x dx ^ /4 4 

t 3 — r — 4 1 2 — y + 8f — 8x = 0. (1 1.83) 

dt 3 dt 2 dt 

The overall hypothesis of this section that the coefficients be continuous and the 
leading coefficient be unequal to zero holds on every interval a < t < b not including 
zero. One can easily verify that the functions f x , f 2 , / 3 defined respectively by f x (t) = t , 
f 2 (t) = t 2 , f 3 (t) = t 4 are all solutions of (11.83) on any such interval. Evaluating the 
Wronskian of these solutions, we find 

t 4 t 2 t 

W(fi, fi , / 3 )(0 - ^ ^ 4 ) = 4t 3 2t 1 = -6t 4 # 0. 

12t 2 2 0 

Thus by Theorem 1 1.21, the three solutions f u / 2 , / 3 are linearly independent on every 
interval a < t < b, not including zero. 


DEFINITION 

A set of n linearly independent solutions of the nth-order homogeneous linear differential 
equation (11.59) is called a fundamental set of (11.59). 


THEOREM 11.22 


There exist fundamental sets of solutions of the nth-order homogeneous linear differential 
equation 


a °^ If + + ‘ " + a "- + a "^ x = 0 


(11.59) 


on a < t < b. 


Proof. By Theorem 11.7 there exists fundamental sets of the corresponding 
homogeneous linear system (1 1.69) on [a, b]. Let 



be such a set. By definition of fundamental set of (1 1 .69), these solutions <|>i , <t> 2 , • • • , <|>„ 
of (11.69) are linearly independent on [a, b]. Then by the converse part of 
Theorem 11.19, the corresponding solutions /i , / 2 , • • • , /„ of (11.59) are linearly 
independent on [a, b]. Finally, by definition of fundamental set of (11.59) these 
solutions /i,/ 2 ,...,/„ constitute a fundamental set of (11.59) on [a, b]. Q.E.D. 
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We now exhibit a fundamental set of (1 1.59). Consider the solutions 



of corresponding homogeneous linear system (1 1.69) defined respectively by 



that is, in general, by 


f?(t o) 


jO, j ¥=i - 1, 

jl, j = i — 1, (i= 1,2,. = 0, 1). 


(11.84) 


By the Corollary to Theorem 1 1 . 1 7, f t , f 2 ,...,/„ is a set of solutions of ( 1 1 .59) which 
satisfy the same initial condition (1 1.84). That is, f lt f 2 , ■ ■ ■ , /„ are n solutions of (1 1.59) 
such that 


= 1 flit 0 ) = 0 

/'i(Io) = 0 f' 2 (t 0 ) = 1 

fl(to) = 0 /Wo) = 0 

/T 2, (t o) = 0 f ( r 2 ) (t 0 ) = 0 
f { r l \t 0 ) = o fr l %) = o 


u t 0 ) = o 
/;(^o) = o 
= 0 

(11.84) 

f { r 2 ) (to) = o 
/r l %) = i- 


Thus the Wronskian f„)(t 0 ) = 1, that is, is not zero; and so by 

Theorem 11.20, W(/i, / w )(t) # 0 for all t e [a, fe]. Thus by Theorem 11.21, the 

solutions /i,/ 2 , . . . , /„ of (1 1.59) so defined form a fundamental set of (1 1.59) on [a, ft]. 


We thus see that any n solutions /i,/ 2 ,...,/„ of Equation (11.59) which satisfy 
conditions of the form (1 1.84) at some point t 0 of a < t < b are linearly independent on 
a < t < ft. However, there are also sets of n linearly independent solutions of 
Equation (1 1.59) which do not satisfy conditions of the form (1 1.84) at any one point t 0 
of a < t < ft. For example, the functions and f 2 defined respectively by f^t) = sin t , 
f 2 (t) = sin t + cos t are a set of linearly independent solutions of 


on every closed interval a < t < ft, but there is no point t 0 such that /i(t 0 ) = 1, 

f\(to) = 0,f 2 (t o ) = 0J' 2 (t o ) = L 
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THEOREM 11.23 

Let /i, / 2 , •••>/*» be a fundamental set of solutions of the nth-order homogeneous linear 
differential equation 


,d n x ,d n ~ l 

aM lF + aM 7r 


x , v dx , v 

T + ••• + a n-i(t) — + a„(t)x = 0, 


dt 


(11.59) 


and let f be an arbitrary solution of (11.59) on [a, b~\. Then f can be represented as a 
suitable linear combination of f 1 ,f 2 ,...,f n .That is, there exist numbers c l ,c 2 ,.-.,c„ such 
that 

f= C lfl + C 2fl + "* + C nfn (11.85) 

on [ a , b~\. 

Proof. Let 



If 


t fl \ 


f \ 


If. \ 

d> = 

f 

» ^1 = 

f'i 

* 

, <l>2 = 

1/2 


fn 




\f ( r l) ) 


i/r u j 


\ fr"i 


be the solutions of the related homogeneous linear system (11.69) corresponding 
respectively to the solutions f f i9 / 2 ,..., /„ of (11.59). Since f i9 / 2 ,..., f n are a 
fundamental set of (11.59) on [a, b], they are linearly independent on [a, b~\. Then by 
Theorem 11.19, <|>i,<|> 2 , •••,<!>« are linearly independent on [a, b~\; that is, they 
constitute a fundamental set of (11.69) on [ a , b~\. Thus by Theorem 11.8 there exist 
numbers c 1 ,c 2 ,...,c n such that 


4> — c l < l ) l + C 2 ( l ) 2 + *“ + C n$n- 


The first component of this vector relation is 


/ - C l/l + C 2fl + “ * + C nfrr 


(11.85) 


Thus there exist numbers c i9 c 2 ,... y c„ such that (11.85) holds. 


Q.E.D. 


► Example 11.16 

Every solution / of 


can be expressed as a linear combination of any two linearly independent solutions f x 
and / 2 . For example, if f(t) = 2 sin t + 4 cos t^ft) = sin t, and f 2 (t) = cos t,we have 
f(t) = 2 fft) + 4 f 2 (t); while if fft) = sin t and f 2 (t) = sin t + cos t 9 we have f(t) = 

- 2/ 1 (0 + 4 / 2 ( 0 . 
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Exercises 

1. Consider the third-order homogeneous linear differential equation 

d 3 x d 2 x dx 

7T-7T + T" x = a 
dt 3 dt 2 dt 

(a) Find the three linearly independent solutions f l9 / 2 , f 3 of this equation 
which are such that 

/i(0)=l, / 2 (0) = 0, / 3 (0) = 0, 

/'i(0) = 0, f 2 ( 0)=1, /a(0) = 0, 

/ 1 (0) = o, rm = o, / 3 (o) = i, 

respectively. 

(b) Express the solution 

2 sin t + 3 cos t + 5e l 

as a linear combination of the three linearly independent solutions f i9 / 2 , and 
/ 3 defined in part (a). 

2. (a) Show that e\ e\t - 1), and 2e l — e 2t are linearly independent solutions of 


, d 2 x „ dx 

■ 4 TT+ 5 "7- 
dt 2 dt 


lx = 0. 


(b) Express the solution 2e x + 3 te l -f 4e 2t as a linear combination of the three 
linearly independent solutions of part (a). 

(c) Find another set of three linearly independent solutions of the given equation. 

3. (a) Show that sin(r 3 ) and cos(t 3 ) is a fundamental set of the differential equation 

d 2 x ^ dx - 
t -p=- — 2— + 9t 5 x = 0 
dt 2 dt 

on every closed interval [ a , ft], where 0 < a < ft < oo. 

(b) Show that t , t 2 , and t 3 is a fundamental set of the differential equation 


, d 3 x ^ ^ d 2 x , dx 
1 3 - rT -3r -rr + ti-r 
dt 3 dt 2 dt 


6x = 0 


on every closed interval [a, ft], where 0 < a < ft < oo. 

4. Consider the second-order homogeneous linear differential equation 

, , d 2 x , x dx 

+ a i(0 -^r + = 0, (A) 

where a 0 , a 1? and a 2 are continuous on an interval a < t < ft, and a 0 (t) # 0 on this 
interval. Let / and g be distinct solutions of differential equation (A) on a < t < ft, 
and suppose g(t) # 0 for all t e [a, ft]. 

(a) Show that 

d r/(0l Ln 

— — - = r , ,_, 2 — for all t g [a, ft]. 


for all t g [ a , ft]. 
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(b) Use the result of part (a) to show that if g(t)~\ = 0 for all t e [a, b], 

then the solutions / and g are linearly dependent on this interval. 

(c) Suppose the solutions / and g are linearly independent on a < t < b, and let h 
be the function defined by h(t) = f(t)/g{t ), a < t < b. Show that h is a 
monotonic function on a < t < b. 

5 . Let / and g be two solutions of the second-order homogeneous linear differential 
equation (A) of Exercise 4 . 

(a) Show that if / and g have a common zero at t 0 e [a, b], then / and g are 
linearly dependent on a < t < b. 

(b) Show that if / and g have relative maxima at a common point t 0 e [a, b], then 
/ and g are linearly dependent on a < t < b. 

6 . Consider the second-order homogeneous linear differential equation (A) of 
Exercise 4 . Let / and g be two solutions of this equation. Show that if / and g are 
linearly independent on a < t < b and A l9 A 2 , B i9 and B 2 are constants such that 
A i B 2 — A 2 B l # 0 , then the solutions A i f + A 2 g and B i f + B 2 g of Equation (A) 
are also linearly independent on a < t < b. 

7 . Let / and g be two solutions of the second-order homogeneous linear differential 
equation (A) of Exercise 4 . Show that if / and g are linearly independent on 
a < t< b and are such that f"(t 0 ) = g"(t 0 ) = 0 at some point t 0 e [a, b\ then 
0i(*o) = fl2^o) = 0. 


8. (a) Show that the substitution 

x = y exp 




transforms the second-order homogeneous linear differential equation 


d 2 x 

~dt 


dx 

~dt 


2 + P i(0 37 + P 2 (t)x — 0 


(A) 


into the second-order homogeneous linear differential equation 

+ |p 2 (0 - \ P\(t) - \ IPi W] 2 }y = o (B) 

in which the first-derivative term is missing. 

(b) If g^t) and g 2 (t) is a fundamental set of Equation (B) on an interval 
a < t < b, show that 


and 


flit) = g t {t) exp 


1 

2 


p t (0 dt 


flit ) = g 2 (t) exp 



P t (t) dt 


is a fundamental set of Equation (A) on this interval. 
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9. Use the substitution given in Exercise 8(a) to transform the differential equation 

d ^ + (2 t +l)^ + (t 2 + t + ^ x = 0 (C) 

into an equation (D) of the form (B) (of Exercise 8) in which the first-derivative 
term is missing. Obtain a fundamental set of the equation (D) which thus results, 
and use this fundamental set to obtain a fundamental set of the given equation (C). 


11.6 FURTHER PROPERTIES OF THE nth-ORDER HOMOGENEOUS 
LINEAR DIFFERENTIAL EQUATION 

A. More Wronskian Theory 


We first use Theorem 11.12, the Abel-Liouville formula for homogeneous linear 
systems, to obtain the corresponding result for the single nth-order homogeneous 
linear scalar differential equation 

d n x d n ~^x dx 

a o(t ) -jpr + «|(0 + • • • + «»-iW Yt + a " (t)x = a (1 L59) 


THEOREM 11.24 Abel-Liouville Formula 


Let /i, / 2 , . • . , /„ be n solutions of the homogeneous linear differential equation (11.59) on 
a < t < b; and let t 0 g [a, b]. Let W denote the Wronskian of /i , / 2 . Then 


W(t) = W(t 0 ) exp 


— 1 

1 

r ds] 

L J 

to a o( s ) J 


(11.86) 


for all t g [a, b]. 


Proof. Since / 1? / 2 ,. are solutions of (11.59), by Theorem 11.17, 



/ fi \ 


/ f 2 \ 


Ifn \ 


!/i 


fl ’ 


f'n 

<l>i = 


» <t>2 = 

fl 

,■■■, <i> n = 

f" 

J n 


• j 

\/r 7 




U"‘7 


are n solutions of the homogeneous linear system (1 1.69) on [a, b ]. Let W* denote the 
Wronskian of <t>i, <| ) 2 , . . . , <|> n . Then by Theorem 1 1.12, the Abel-Liouville formula for 
systems, 


W*(t) = W*(t 0 )GX p 



tr A(s) ds 


(11.87) 


where A is the coefficient matrix of system (1 1.69), holds for all t g [a, b~\. By formula 
(1 1.82), W*(t) = W(t) for all t g [ a , b]. Also, the coefficient matrix A of system (1 1.69) is 
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given by (1 1.71); and from this we see that tr A(s) in (1 1.87) is simply — a 1 (s)/a 0 (s). Thus 
(1 1.87) reduces at once to 


W(t) = W(t 0 ) exp 


r-r*?4 

L JfoMs) J 


Q.E.D. 


► Example 11.17 

The coefficients t 2 -f 1 and 2 1 in the differential equation 


drx 


dx 


(f2 + 1) ^ +2 ^ = ° 


(11.88) 


are continuous for all r, — oo < t < oo, and t 2 + 1 # 0 for all real t. The reader can 
verify that j\ and f 2 defined, respectively, by 

fi(t) = 1 and f 2 {t) = arctan t 

are solutions of Equation (11.88) on — oo < t < oo. Since 




1 arctan t 
0 1 


1 + t 2 


1 


1 + V 


*0 


on every finite closed interval a < t < b, we conclude that f x and f 2 are a set of linearly 
independent solutions of Equation (1 1.88) on every such interval. In this case the Abel- 
Liouville formula 


W(t) = W(t 0 ) exp 


Jto a o(s) J 


becomes 


for all t e [a, b~\. 


iT7 T “iT7I exp [ - £s 3 TT‘ ,s ] 


In order to obtain the next theorem, we shall need the following result from linear 
algebra, which we state without proof. 


THEOREM A 

Suppose m> n. Then a system of n homogeneous linear algebraic equations in m 
unknowns always has nontrivial solutions. 


THEOREM 11.25 

Suppose m > n\ and let fuf 2 ,---if m b e asetof m solutions of the nth-order homogeneous 
linear differential equation (11.59) on a < t < b. Then f u f 2 , ...,f m are linearly 
dependent on a < t < b. 
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Proof. By hypothesis, /i,/ 2 ,...,/ m are m solutions of the nth-order equation 
(11.59) on a < t < b, where m > n. Let t 0 be a point of the interval a < t < b and 
consider the following system of n homogeneous linear algebraic equations in the m 
unknowns k l ,k 2 ,...,k m . 

*i/i(*o) + k 2 f 2 (to) + *** + k m f m (t 0 ) = 0, 

kif\(t 0 ) + k 2 f' 2 (t 0 ) + *'* + k m f f m (t 0 ) = 0 , 

(11.89) 

kj { r l) (t 0 ) + k 2 fr l \t 0 ) + ••• + k m f { n~ l) (t 0 ) = 0 . 

By Theorem A this system has a solution 

ki = c t , k 2 = c 2 ,..., k m = c m , 

where c i9 c 2 , . . . , c m are not all zero . 

Now consider the function / defined by 

m = Cj x (t) + c 2 f 2 (t) + • • • + (1 1.90) 

for alU on a < t < b. The function / is a linear combination of the solutions f i9 f 2 , . . . , 
f m . Hence by Theorem 1 1.18 the function / is also a solution of Equation (1 1.59) on 
a < t < b. 

Now differentiate (11.90) (n — 1) times and let t = t 0 in (11.90) and the ( n — 1) 
derivatives so obtained. We have 

f(t 0 ) = C lfl( t o) + C 2/2(*o) + * " + C mfm(to)’ 
f'(t 0 ) = cj\(t 0 ) + c 2 f' 2 (t 0 ) + • • ■ + cj m {t 0 ) 9 


f in - l) (t 0 ) = cj ( r l) (t 0 ) + c 2 fr l \t 0 ) + ••• + c./r^o). 

The right members of these expressions are precisely the left members of Equations 
(11.89) with the unknowns k t replaced by c t (i = 1, 2,..., m). Since c i9 c 2 , . . . , c m is a 
solution of the system (1 1.89), we see at once that the solution / of Equation (1 1.59) is 
such that 

f(to) = f'(t o ) = -- = f in - 1) (t o ) = 0. 

Thus by the Corollary to Theorem 1 1.16 we conclude that f(t) = 0 for all t such that 
a<t<b. Thus from (1 1.90) we have 

c i/i(0 + ^2/2(0 + * ’ * + 

a <t <b 9 where c l ,c 2 ,...,c m are not all zero. Thus the solutions are 

linearly dependent on a < t < b. Q.E.D. 

The Corollary to Theorem 11.21 states a necessary and sufficient condition that n 
solutions of the nth-order homogeneous linear differential equation 

^ + fli(f) ~ + a„(t)x = 0 


(11.59) 
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be linearly dependent on a < t < b. This condition is that the value of the Wronskian 
of the n solutions be zero for all t on a < t < b. We now inquire whether or not this 
condition remains both necessary and sufficient if the n functions involved are not 
necessarily solutions of an nth-order homogeneous linear differential equation but are 
merely functions having a continuous (n — l)st derivative on a < t < b. The following 
theorem tells us that the condition is still necessary. 


THEOREM 11.26 


Hypothesis 

7. Let each of the functions /i ,/2 have a continuous (n — 1 )st derivative on 

a < t < b. 

2. Let the functions f l9 /*, • . . , f„ be linearly dependent on a < t < b. 

Conclusion. W(f t , / 2 , . . . , f n )(t) = 0 for all t on a < t < b. 

Proof. Since / lv / 2 , . . . , /„ are linearly dependent on a < t < b, there exist constants 
c l ,c 2 ,-.-,c„, not all zero , such that 

c i/i(0 + c 2 f 2 (t) + •*• + c n f n (t) = 0 (11.91) 

for all t on a < t < b. From (1 1.91) we obtain 

Cl AW + C 2 f' 2 (t) + ••• + cj'„{t) = 0, 


(11.92) 

c,A" _i, w + + ■■■ + c jr i \t ) = o 

for all t e [a, b~\. Now let t = t 0 be an arbitrary point of a < t < b. Then (1 1.91) and 
(1 1.92) hold at t = t 0 . That is, the relations 


cJiih) + c 2 f 2 (t 0 ) + ■■■ + c„f„(t 0 ) = 0 
cj'iih) + c 2 f' 2 (t 0 ) + ••• + c n f' n (t 0 ) = 0 


cjr i \to) + c 2 fr l) (t 0 ) + ■■■ + c H fr l) (t 0 ) = o. 

hold, where c t ,c 2 ,...,c„ are not all zero. Thus by Chapter 7, Section 7.5, Theorem A, we 
have 


Mto) flit 0 ) ••• fn(t o) 

fiUo) fiito) ••• /'.(to) 

f ( r l) Uo) f ( r l) (t 0 ) ... fr l) u 0 ) 

But this determinant is the value of the Wronskian of f l , f 2 , . 
arbitrary point of [a, b]. Hence 


= 0 . 


. . , /„ at t = t 0 , and t 0 is an 




for all t e [a, h]. 


Q.E.D. 



562 THE THEORY OF LINEAR DIFFERENTIAL EQUATIONS 


We now consider whether or not the condition W(f x ,f 2 ,...,f n ){t) = 0 for all t on 
a < t < b is still sufficient for linear dependence in the present case. The following 
example shows that it is not. 


► Example 11.18 


Consider the functions f l9 f 2 , and / 3 defined, respectively, by 
fi(t) = t\ f 2 (t) = 1, / 3 (r) = m 3 . 

Each of these functions has a continuous second derivative on every interval a < t < b. 
Further, 


W(fi, fi, /sXO = 


t 3 1 |r| 3 

3 t 2 0 3t\t\ 

6 1 0 6\t\ 


= 0 , 


a < t < b. 


But the functions / 1? / 2 , and / 3 are not linearly dependent on any interval a < t <b 
which includes t = 0 (see Exercise 3 at the end of this section). 


Thus we see that the condition W{f x , / 2 , . . . , f n )(t) = 0 for all t on a < t < b is not 
sufficient for linear dependence in general. However, we shall obtain the following 
result. 


THEOREM 11.27 
Hypothesis 

1. Let each of the functions f i9 / 2 ,...,/„ have a continuous (n — 1 )st derivative on 
a < t < b. 

2. Suppose W(f t , f 2 , . . . , f n )(t) = 0 for all t on a < t < b. 

3. Suppose the Wronskian of some (n — 1) of the functions /i,/ 2 , • • • ,/„ is unequal 
to zero for all t on a < t < b. That is, without loss in generality, suppose that 
W(f i9 f 2 ,...,f H -i)(t) #0 for alltona<t< b. 

Conclusion. Then the functions /i,/ 2 ,...,/„ are linearly dependent on a < t < b. 


Proof. The equation 


TO/iW A-iW x]=0; 


that is, 


Mt) f 2 (t) ••• X 

f\(t) flit) ••• fn-ltt) X' 


rr l) (t) ... /f.v’w * 


<«-!> 


(11.93) 


is a homogeneous linear differential equation of order (n — 1). By Hypothesis 1 the 
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coefficients are continuous on a < t < b. The coefficient of — - - 1 is W(f l9 / 2 ,..., 


f„~i)(t), and by Hypothesis 3 this is never zero on a < t < b. If any one of 
/ ft), f 2 (t ), . . . , /„_ ft) is substituted for x in Equation (1 1.93), the resulting determinant 
will have two identical columns. Thus each of the functions f t , f 2 x is a solution 

of the differential equation (1 1.93) on a < t < b. By Hypothesis 2 the function /„ is also 
a solution of Equation (11.93) on a < t < b. Thus on a < t < b the n functions 
are solutions of the differential equation (11.93). Since this 
differential equation is of order ( n - 1), we know from Theorem 1 1.25 that no set of n 
solutions of it can be linearly independent. Thus the functions / 1? / 2 , ...,/„ a re linearly 
dependent on a < t < b. Q.E.D. 


► Example 11.19 


Although the functions f i9 f 2 , of Example 1 1.18 are not linearly dependent on any 
interval a < t < b, including t = 0, it is clear that they are linearly dependent on every 
interval a < t < b which does not include t = 0. Observe that on any such interval not 
including t = 0 the hypotheses, of Theorem 1 1.27 are indeed satisfied. In particular, 
note that Hypothesis 3 holds, since 




t 3 
3 1 2 


1 

0 


-3f 2 #0 for t # 0. 


Theorem 11.22 showed that every /ith-order homogeneous linear differential 
equation (11.59) possesses sets of n linearly independent solutions on a < t < b, and 
Theorem 1 1.21 showed that the value of the Wronskian of such a set is unequal to zero 
on a < t < b. Now, given a set of n functions having continuous nth derivatives and a 
nonzero Wronskian on a < t < b, does there exist an nth-order homogeneous linear 
differential equation for which this set of n functions constitutes a fundamental set (set 
of n linearly independent solutions) on a < t < bl In answer to this we shall obtain the 
following theorem. 


THEOREM 11.28 
Hypothesis 

1. Let /i ,/ 2 ,. be a set of n functions each of which has a continuous nth 
derivative on a < t < b. 

2. Suppose W{f i , / 2 , . . . , f n )(t) / 0 for all t on a < t < b. 


Conclusion. There exists a unique normalized (coefficient of is unity) homo- 
geneous linear differential equation of order n (with continuous coefficients) which has 
fiifiT-'ifn as a fundamental set on a < t < b. This equation is 

WU> 


(11.94) 
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Proof. The differential equation (1 1.94) is actually 

flit) f 2 (t) ... f„(t) X 

/’ft) m ... m x' 


Lift) ff{t) ... /<">« x<”> 


flit) 

fiit) . 

• • m 

flit) 

fiit) ■ 

•• /;(f) 

T l> it) 

fr i] it) • 

•• fn~ l \t) 


This is of the form 


d n x ,d n ~ 

1F + Pl ^~dF 


x , x dx , , 

r + • • • + p n . i(t) — + p n (t)x = 0, 


and so is a normalized homogeneous linear differential equation of order n. Also, by 
Hypotheses 1 and 2 the coefficients p t (i = 1, 2, . . . , n) are continuous on a < t < b. If any 
one of is substituted for x in Equation (11.95), the resulting 

determinant in the numerator will have two identical columns. Thus each of the 
functions f i9 / 2 , is a solution of Equation (1 1.94) on a < t < b; and by Theorem 
11.21 we see from Hypothesis 2 that these solutions are linearly independent on 
a <t < b. Thus Equation (11.94) is indeed an equation of the required type having 
as a fundamental set. 

We must now show that Equation (11.94) is the only normalized nth-order 
homogeneous linear differential equation with continuous coefficients which has this 
property. Suppose there are two such equations 


d n x d n l x dx 

+ ^ l(t) + " ‘ ' ' + q - l(0 Tt + q " (t)x ~ °’ 

d n x d n ~ * y dx 

IF + ri(t) + "' + r "- l{t) lf + r " {t)x ' = a 


(11.96) 


Then the equation 

c«?i(f) - MO] + • • • + [MO - r„(0]x = 0 (1 1.97) 

is a homogeneous linear differential equation of order at most (n — 1), and the 
coefficients in Equation (11.97) are continuous on a < t < b. Further, since /i,/ 2 , •••,/„ 
satisfy both of Equations (1 1.96) these n functions are all solutions of Equation (1 1.97) 
on a < t < b. 

We shall show that q t (t) - r x (t) = Oforalhona < t < b.Todoso,leU 0 be a point of 
the interval a<t<b and suppose that qi{t 0 ) - r^to) ^ 0. Then there exists a 
subinterval /, a <; t < /?, of a < t < b containing t 0 such that q t (t) - r^t) # 0 on /. 
Since the n solutions f i9 f 2 , •••,/« of Equation (11.97) are linearly independent on 
a < t < b, they are also linearly independent on /. Thus on /, Equation (1 1.97) of order 
at most (n — 1), has a set of n linearly independent solutions. But this contradicts 
Theorem 1 1.25. Thus there exists no t 0 e [a, b~\ such that q^to) — r^to) # 0. In other 
words, q x (t) — r^t) = 0 for all r on a < t < b . 
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In like manner, one shows that q k (t) — r k (t) = 0, k = 2, 3, . . . , n, for all t on a < t < b. 
Thus Equations (1 1.96) are identical on a < t < b and the uniqueness is proved. 

Q.E.D. 


► Example 1 1 .20 


Consider the functions f i and f 2 defined, respectively, by f t (t) = t and f 2 (t) = te x . We 
note that 




t te x 
1 te x + e l 


t 2 e x #0 for t # 0. 


Thus by Theorem 11.28 on every closed interval a < t < b not including t = 0 there 
exists a unique normalized second-order homogeneous linear differential equation 
with continuous coefficients which has f i and f 2 as a fundamental set. Indeed, Theorem 
1 1.28 states that this equation is 


W[t, te x , x] 


= 0 . 


W[t , te x ] 

Writing out the two Wronskians involved, this becomes 


t te x x 

1 te x + e x x ' 
0 te x + 2e x x" 


= 0 . 


t te x 
1 te x + e x 


Since 


t 

t 

X 

o d 2 x dx 

t t 

1 

t + 1 

x' 

= t 2 —~y — t(t + 2) — + (t + 2)x and 
dt dt 

i t + 1 

0 

t + 2 

x" 



we see that this differential equation is 

d 2 x ft + 2\ dx 
dt 2 l t ) dt 


+ 


t + 2\ 


Jx = 0. 


= t 2 , 


B. Reduction of Order 

In Chapter 4 we introduced the process of reduction of order of an nth-order 
homogeneous linear differential equation. There we stated the basic theorem without 
proof and went on to consider the second-order case in greater detail. We employed the 
process in Chapter 4 and later in Chapter 6. We shall now restate the basic theorem, 
give its proof, and indicate further how it may be employed. 
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THEOREM 11.29 

Hypothesis. Let f be a nontrivial solution of the nth-order homogeneous linear 
differential equation 


d n x d n dx 

a o(t) + aft) + • • • + ft) — + a n (t)x = 0. (1 1.59) 

Conclusion. The transformation 

x = f(t)v (11.98) 

reduces Equation (11.59) to an (n — 1 )st-order homogeneous linear differential equation in 
the dependent variable w, where w = dv/dt. 


Proof. Let x = f (t)v. Then 


dx dv 


dt 2 


dt 


d n x d n v „ . 


d n l v n(n - 1) . d" ~v 


2 , 


2! 




Substituting these expressions into the differential equation (11.59), we have 


4 n ~ It 


a o(t) I ff) + nf'(t) ’ • • + f in) (t)v 


dt 
+ aft) 


d n *v d n 2 v 

m + (" - !)/'(») ^rrr + • ■ " ■ + P- l) (t)v 


+ *“ + a n-ft) 


™ d i +f 


'(t)v + a n i 


M/M” = 0 


or 


d n v 


a o(t)f(t)-^ + [na o (t)/'(0 + tfi(0/(0] + “■ 


+ [«fl 0 W/ ( " _1, W + - + «.- 1(0/(0] 


dt? 

di 


+ [«o(0/ ,n, (f) + a i( t )/ < " _1> ( t ) + ••• + a„_ 1 (r)/'(f)‘+ a„(f)/(f)]t7 = 0. (11.99) 

Now since / is a solution of Equation (1 1.59), the coefficient of v is zero. Then, letting 
w = dv/dt, Equation (1 1.99) reduces to the (n - l)st-order equation in w, 

d"~ l w d"~ 2 w 

A 0 (t) ^ f «-2 + ••• + A„- l (t)\V = 0, 


(11.100) 
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where 

A 0 (t) = a 0 (t)f(t), 

A^t) = na 0 (t)f'(t) + 

4,-iW = na 0 (t)f in ~ l \t) + ••• + «„-l(0/W- 

Q.E.D. 


Now suppose that w l5 w 2 , . . . , w„_ t is a known fundamental set of equation (1 1.100). 
Then v l9 v 2 defined by 




w t (t)dt, 


v 2 (t) = 


I 


w 2 (r) dt,.. 


v„-i (t) = 


w„-i (0 


is a set of (n — 1) solutions of Equation (1 1.99). Also, the function t;„ such that v„(t) = 1 
for all t is a solution of Equation (11.99). These n solutions v i9 t; 2 ,..., t;„ of Equation 
(11.99) are linearly independent. Then, using (11.98) we obtain n solutions f h where 
fi(t) = = l,2,...,n)of the original nth-order equation (11.59). Then solutions 

fi so defined are also linearly independent and thus constitute a fundamental set of 
Equation (11.59). 

One may extend Theorem 1 1.29 to show that if m (where m < n) linearly independent 
solutions of Equation (11.59) are known, then Equation (11.59) may be reduced to a 
homogeneous linear equation of order (n - m). 


Exercises 


1 . 


In each case, verify the truth of the Abel-Liouville formula (Theorem 1 1.24) for the 
given fundamental set of the given differential equation. 

(a) Fundamental set: sin (t 3 ), cos (t 3 ). 

Differential equation: 


2 x ^ dx 


+ 9 t 5 x = 0. 


(b) 


Fundamental set: t, t 2 , t 3 . 
Differential equation: 


d 3 x 


d 2 x 


t 3 — r-- 3 1 2 — r + 6f 


dt 


dt 2 


dx 

It 


6x = 0. 


2 . 


3 . 


Let / and g be defined, respectively, by 
t 2 , t > 0, 


f(t) = 


t < 0 


and g(t) 


= |0, t > 0, 
{r 2 , t < 0. 


Show that W(f g)(t) = 0 for all t but that / and g are not linearly dependent on any 
interval a < t < b which includes t = 0. 

Consider the functions considered in Example 11.18, namely, / , , f 2 , and / 3 defined 
respectively by 


(a) 


A(t) = t\ f 2 (t)= l, Mt) = \t | 3 . 

Show that each of these functions has a continuous second derivative on every 
interval a < t < b. 
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(b) Show that these three functions are not linearly dependent on any interval 
which includes t = 0 . 

4. In each case find the unique normalized homogeneous linear differential equation 
of order 3 (with continuous coefficients) which has the given set as a fundamental 
set on some interval a < t <b. 

(a) t, t 2 , e‘. 

(b) t , sin t , cos t. 

(c) t, t 3 , t 5 . 

(d) e\ t 2 , t 2 e'. 

5. The differential equation 

(r 3 - It 2 ) + 2 1 2 - 6 1 ) ^ + (3r 2 - 6)x = 0 


has a solution of the form t n , where n is an integer. 

(a) Find this solution of the form t n . 

(b) Using the solution found in part (a) reduce the order and find the general 
solution of the given differential equation. 

6 . Given that the differential equation 

. d 3 x , ^ 9 d 2 x _ , . dx 

t — (t + 3 )t + 2 t(t + 3) — — 2(t + 3)x — 0 

has two linearly independent solutions of the form t n , where n is an integer, find the 
general solution. 

7. Let {/, g] and {F, G } be two fundamental sets of the differential equation 

d 2 x dx 

flow -faT + «i(0 y + «2 (t)x = o 

on a < t < b. Use the Abel-Liouville formula to show there exists a constant c # 0 
such that W{f g)(t) = cW{F , G)(t) for all t e [a, b]. 

8 . Consider the normalized nth-order homogeneous linear differential equation 

d n x n , d n ~ l x „ , x dx n x 

+ P i(0 + ”* + + ^»(0 X = 0, (A) 


where /^(i = 1, 2,..., n) are continuous on a < t < b. Let /i,/ 2 , . . . be a fun- 
damental set of this equation on a < t < b. By the uniqueness result of Theo- 
rem 1 1.28 the differential equation (A) may be written 


and the coefficients of 


d n ~ l x 

dt n ~ l 


in the two expressions (A) and (B) for the differential 


equation must be identical. Equate these coefficients to show that 


W(f l,/2, 
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satisfies the first-order differential equation 

dW 

— + Pft)W = 0. 

Note that this paves the way for the proof of the Abel-Liouville formula 
(Theorem 11.24) in the case of the normalized nth-order equation. 

11.7 THE nth-ORDER NONHOMOGENEOUS LINEAR EQUATION 

In this section we consider briefly the nth-order nonhomogeneous linear scalar 
differential equation 

d n x d n ~^x dx 

a °(t) ppr + ai(0 pprr + ■■■ + 1(0 pp + a„(t)x = F(t). (11.101) 

Using the operator notation already introduced in Section 1 1.5 of this chapter, we may 
write this as 


L n x = F(t ), (11.102) 

where, as before, 

d n d n ~ l d 

1 n = «o(f)^ + a i( t )pprr + ••' + 

We now prove the basic theorem dealing with Equation (11.102). 

THEOREM 11.30 
Hypothesis 

1. Let v be any solution of the nonhomogeneous equation (1 1 .102). 

2. Let u be any solution of the corresponding homogeneous equation. 

L„x = 0. (11.103) 

Conclusion. Then u -f vis also a solution of the nonhomogeneous equation (11.102). 

Proof. We have 

L„[u(t) + u(t)] 

d n d"~ l 

= «o(0 pp [«(0 + f(0] + a i(t) pp-j; [»(0 + t>(t)] 

+ ••• + ««-i (t)j t [u(t) + y(f)] + a„(t)lu(t) + t>(f)] 

d n d n ~ l d 

= ao(t) pp u(t) + «i(t)^TT u (0 + ••• + + a n (t)u(t) 

d" d n ~ l d 

+ «o(0 pp v(t) + «i(0 pp^r ®(0 + ' ' • + a n -i(t) j t y(f) + a n (t)v(t) 


= L„[u(f)] + L„[v(t)]. 
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Now by Hypothesis 1, L„[t;(r)] = F(t ); and by Hypothesis 2, L„[u(t)] = 0. Thus 
L n [u(t) + = F(t); that is, u + v is a solution of Equation (1 1.102). Q.E.D. 

In particular, if /i,/ 2 ,...,/„ is a fundamental set of the homogeneous equation 
(1 1. 103) and v is any particular solution of the nonhomogeneous equation (1 1.102) then 

C lfl + C 2fl + * * * + C nfn + V 

is also a solution of the nonhomogeneous equation (11.102). If a fundamental set 
/i,/ 2 ,. ••>/* °f the corresponding homogeneous equation (11.103) is known, then a 
particular solution of the nonhomogeneous equation (1 1.102) can always be found by 
the method of variation of parameters. This method was introduced in Chapter 4, 
Section 4.4. In that section we gave a detailed development of the method for the case 
of the second-order equation and considered several illustrative examples. We now 
state and prove the relevant theorem for the general nth-order equation. 


THEOREM 11.31 


Hypothesis 

1. Let /i , / 2 , be a fundamental set of the homogeneous equation 

L n x = 0 (11.103) 

on the interval a < t < b. 

2. Let v be a function defined by 

v(t) = Mt)/i(0 + v 2 (t)f 2 (t) + - + v n (t)fM (11.104) 

a < t < b, where v l ,v 2 ,---,v„ are functions such that 

y i(0/i(0 + v' 2 (t)f 2 (t) + * * * + v'„(t)f n (t) = 0, 

v\(t)f\(t) + v 2 (t)f 2 (t) + ••• + 1 mm = o, 


: (11.105) 

v\(t)fr 2) u) + v 2 (t)fr 2 \t) + ••• + v n (t)fr 2 \t) = o, 
v\(t)fr l \t) + v 2 (t)/r i \t) + ••• + v„(t )/r i, (o = 

«o(0 

for a < t < b. 


Conclusion. Each such function v is a particular solution of the nonhomogeneous 
equation 

L n x = F(t) (11.102) 

on a < t < b. 


Proof. We first consider the linear algebraic system (11.105) in the n unknowns 
v\(t), v 2 (t), . . . , The determinant of coefficients of this system is 
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Since /i,/ 2 ,...,/ n is a fundamental set of the homogeneous equation (11.103) on 
a < t < b, this determinant is different from zero for all t e [a, h]. Therefore the system 
(11.105) uniquely determines v\(t), v 2 (t),..., a < t < b, and hence v x (t) 9 v 2 (t),..., 
v n (t) may be determined by antidifferentiation. 

In this manner functions v are indeed defined by (11.104). We must now show that 
any such function v satisfies the nonhomogeneous equation (11.102). 

Differentiating (1 1.104), we obtain 

*(0 = M0/i(0 + v 2 (t)f 2 (t) + - + V m (t)f m (t) 

+ mm + v' 2 (t)f 2 (t) + ••• + v' n ( t )fM 

and using the first equation of (1 1.105) this reduces to 

At) = MO/'i (0 + v 2 (t)f 2 (t) + ••• + v n (t)f’M (11.106) 

Differentiating (11.106), we obtain 

At) = v l (0/1(0 + M0/SM + ••• + ».(0/:(0 

+ ^i(0/i(0 + v 2 (t)f' 2 (t) + ••• + v' n (t)f' n (t); 
and using the second equation of (1 1.105) this reduces to 

v"(t) = M0/l(0 + v 2 (t)f 2 (t) + + v n (t)f'M (11.107) 

Proceeding in like manner, we also obtain 

= v, (t)m) + v 2 (t)/m + - + v n {t)f:\t\ 

= »i(o/r “(o + v 2 (t)pr l) (t) + ••• + r.(o/L" _: l \t), duos) 

V (n \t) = V Mfm + V 2 (t)fP(t) + ••• + v n (t)f?(t) + 

a 0 (t) 

We now substitute (11.104), (11.106), (11.107), and (11.108) into the nonhomo- 
geneous differential equation (1 1.102). We obtain 

+ V 2 (t)p 2 n) (t) + ■■■ + v„(t)p;\t) + 

+ «i(0[»i(0/? _1, (0 + »a(0/? _1, (0 + ••• + ».(0/f _1) (0] 

+ ••• + a„(0l>l(0/l(0 + V 2 (t)f 2 (t) + ••• + v„{t)f„(t)} = F(t) 
or 

®i(0[flo(0/S°(0 + «i(0/ ( " _1, (0 + ••• + «n(t)/i(0] 

+ MOCMO/iftO + «i(t)/? -1, (t) + ••• + a„(f)/ 2 (t)] 

+ ••• + MOKw/no + a 1 (t)/r ,, (o + ••• + «.(o/.(o] 

+ F(t) = F(t). (11.109) 

Since f t , / 2 , . . . , f„ are solutions of the homogeneous equation ( 1 1 . 1 03) on a < t < b, 
each of the expressions in brackets in (11. 109) is equal to zero for all t e [ a , b]. Therefore 
(1 1.109) reduces to F(t) = F(t) and thus the function v satisfies the nonhomogeneous 
differential equation (11.102) on a < t < b. Q.E.D. 
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We now prove the so-called superposition principle for particular solutions of 
Equation (11.102). 


THEOREM 11.32 
Hypothesis 

1. Let L n x = F t (t ), i = l,2,...,m, be m different nonhomogeneous linear equations , 
each of which has the same left member 

d n x d n ~ l x 

L„x = a 0 (0 -jpr + a i(t) + ' ' ' + a„(t)x. 

2. Let f be a particular solution of L n x = Fff), i = 1,2 , . . . , m. 

m 

Conclusion. Then £ kffi is a particular solution of the equation 


i = 1 


L n x = X kM), 

i= 1 


where k l9 fc 2 , . . . , k m are constants. 

Proof. We have 


m d n m m 

■n £ kJi(t) = a 0 (t) — £ kJAt) + • • • + a„(t) £ kj^t) 

_i = 1 J at l_i=i J [_; = 1 


= k 1 


d" 

aoW-jpMt) + ••' + 




+ 


+ K 


d" 


a° (t) ^ f m (t) + • • • + a„(t)fjt) 


] 


= fc 1 L„[/ 1 (0] +--- + k m L„lf m m 

By Hypothesis 2, f t satisfies L„x = f|(t), i = 1, 2, m. Thus ^[/^t)] = 

m “I m 

^ H [/m(0] = ^m(0. and so L„ £ + •• • + That is £ &;/; is a 

i = 1 J i=l 

Q.E.D. 


solution of L n x = £ 

i = 1 


Exercises 

1. 


Show that Theorem 11.30 does not in general apply to nonlinear equations by 
showing that although / such that f(t) = t is solution of 

d 2 x 0 dx ~ 

-rr-t 2 — + x ; ' = 0 
dt 2 dt 


and g such that g(t) = 1 is a solution of 


d 2 x 

1 ? 


- t 


2 dx 2 

7i + x - 1 - 
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the sum / + g is not a solution of 


d 2 x , dx 0 

-r T -t 2 — + X l = l. 

dt 2 dt 


2. Apply the superposition principle for particular solutions (Theorem 11.32) to find a 
particular solution of 


d 2 x ^ sin kt 


11.8 STURM THEORY 

A. Self-Adjoint Equations of the Second Order 

In Chapter 14 we shall present a basic method for obtaining a formal solution of a type 
of problem which involves a partial differential equation and various supplementary 
conditions. Many problems of this type actually arise in the application of differential 
equations to certain problems of mathematical physics and engineering. In solving 
such problems we encounter boundary-value problems involving second-order 
ordinary differential equations; many of these second-order ordinary differential 
equations are, or can be transformed into, so-called self-adjoint form. In this section we 
shall consider some basic properties of these second-order self-adjoint ordinary 
differential equations. We begin by introducing the adjoint of a second-order homo- 
geneous linear differential equation. 


DEFINITION 

Consider the second-order homogeneous linear differential equation 

d 2 x dx 

a o(t)-^r + a^t)— + a 2 (t)x = 0, (11.110) 

where a 0 has a continuous second derivative , a i has a continuous first derivative , a 2 is 
continuous , and a 0 (t) # 0 on a < t < b. The adjoint equation to Equation (11.110) is 

d 2 d 

^2 [«o(f)x] - J t l>i(0*] + a 2 {t)x = 0, 

that is, after taking the indicated derivatives, 
d 2 x dx 

a 0 (t) + [2a' 0 (f) - a t (f)] — + [a'd(f) - a\(t) + a 2 (f)]x = 0, (11.111) 


where the primes denote differentiation with respect to t. 

Note. We assume the additional hypothesis on a 0 and a l stated in the defini- 
tion in order that each of the coefficients in the adjoint equation (11.111) be continu- 
ous on a < t < b. 
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► Example 11.21 

Consider 


d 2 x „ dx 


-b It — — h 8x — 0. 


dt 2 dt 


Here a 0 (t) = t 2 , aft) = It, a 2 (t) = 8. By (11.111), the adjoint equation to this equa- 
tion is 


or simply 


+ [4r - 7f] d f + [2 - 7 + 8]x = 0 


d 2 x „ dx 


dt 2 


dt 


The adjoint equation of the adjoint equation of Equation (11.110) is always the 
original equation (11.110) itself (see Exercise 2 at the end of this section). We now turn 
to the special situation in which the adjoint equation (11.111) of Equation (11.110) is 
also Equation (11.110) itself. 


DEFINITION 

The second-order homogeneous linear differential equation 

d 2 x dx 

a 0 (t) 1 p- + a l (t)— + a 2 (t)x = 0 (11.110) 

is called self-adjoint if it is identical with its adjoint Equation (11.111). 


THEOREM 11.33 

Consider the second-order linear differential equation 

a o(l)^T + + «2(t)^ = 0, (11.110) 

where a 0 has a continuous second derivative, a l has a continuous first derivative, a 2 is 
continuous, and a 0 (t) #0 on a < t < b. A necessary and sufficient condition that 
Equation ( 11.110 ) be self-adjoint is that 

^[«o(f)]=«i (t) (11-112) 


on a < t < b. 


Proof. By the definition, the adjoint equation to Equation (1 1.110) is 
d 2 x dx 

a o(t)~^T + l>o« - «i(t)] Yt + [a ° W “ + a ^ X = °- ( 1U11 > 
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If condition (1 1.1 12) is satisfied, then 

2a' 0 (t) - aft) = a^f) 
and 

am - a',(t) + aft ) = a 2 (t). 

These relations show that Equations (11.110) and (11.111) are identical; that is, 
equation (11.110) is self-adjoint. 

Conversely, if Equations (11.110) and (1 1.1 1 1) are identical then 

2a’ 0 (t) - aft) = aft) 
and 

a'o(t) - a\(t) + a 2 (t) = a 2 (t). 

The second of these conditions shows that a' 0 (t) = a t (t) + c, where c is an arbitrary 
constant. From the first condition we see that a' 0 (t) = a t (t). Thus c = 0 and we have the 
condition (11.112). Q.E.D. 


COROLLARY 

Hypothesis. Suppose the second-order equation 

d^x dx 

«o(t) -^2 + + a 2 (t) X = 0 


( 11 . 110 ) 


is self-adjoint. 


Conclusion. Then Equation ( 11.110 ) can be written in the form 


d_ 

It 


dx 

aM dt\ 


+ a 2 (t)x = 0. 


(11.113) 


Proof. Since Equation (11.110) is self-adjoint, condition (11.112) is satisfied. Thus 
Equation (11.110) may be written 


/ v d 2 x t v dx 
«o(0 + a 0 (t) — + a 2 (t)x = 0 


or 


d_ 

di 


/ , dx 

[ aM lu. 


+ a 2 (t)x = 0. 


Q.E.D. 


► Example 1 1 .22 Legendre's equation 


d 2 x „ dx 

(1 - t 2 )-^-- 2r — + n(n + l)x = 0 

is self-adjoint, for a 0 (t) = l — t 2 , aft) = —2 1 and (d/dt)( 1 — t 2 ) = —2 1 . Written in the 
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form (11.113), it is 


d_ 

It 


(1 - t 2 ) 


dx 

It 


+ n(n + l)x = 0 . 


THEOREM 11.34 


Hypothesis. The coefficients a 0 , a l5 and a 2 in the differential equation 

d 2 x , . dx 

H + aM Tt 


a o(t) — t T + «i(0 37 + MO* = 0 


( 11 . 110 ) 


are continuous on a < t < b, and a 0 (t) # 0 on a < t < b. 


Conclusion. Equation (11.110) can be transformed into the equivalent self-adjoint 
equation 

d_ 
dt 

on a < t < b, where 


[<] 


+ Q(t)x = o 


(11.114) 


P(t) = expT [-4^ ^1» ^^^exp^N^r], (11.115) 

LJ a 0 {t) a 0 (t) |_J a 0 (t) J 


by multiplication throughout by the factor 

1 


a 0 (t) 


r r«i( f ) i 

exp — -{dt . 

LJ a o ( t ) 


(11.116) 


Proof. M ultiplying Equation (11.110) through by the factor ( 1 1 . 1 1 6) we obtain 

’ai(f) 


exp 


ja 0 (t) J dt 2 a 0 (t) [J a o (0 J 


dx a 2 (t) 

+ -^77 exp 


dt a 0 (t) 

This equation is clearly self-adjoint and may be written as 


dt 


exp 


J flo (0 


dx) a 2 (t) 

+ ^exp 
dt j a 0 (t) 


_J a 0 (t) 


dt 


«o (0 


x = 0 


dt 


x = 0 . 


or 


R] 


j t \m-i7\ + Q(t)x = o, 


where P and Q are given by (1 1.1 15). 


Q.E.D. 


► Example 1 1 .23 

Consider the equation 


d 2 x 


dx 


t 2 — r - 2t — + 2x = 0. 


dt 


dt 


(11.117) 
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Here a 0 (t) = t 2 , a t {t) = -2 1, a 2 (t) = 2. Since a' 0 (t) = 2 t # -2 1 = a t (t), Equation 
(11.117) is not self-adjoint. Let us form the factor (11.116) for this equation. We have 


1 

a 0 {t) 





Multiplying Equation (11. 11 7) by 1/t 4 on any interval a < t < b which does not include 
t = 0, we obtain 


1 d 2 x 2 dx 2 _ 

l I lv r ~?^ + 7 x = 0 ' 


Since 



2 


t 3 ' 


this equation is self-adjoint and may be written in the form 


d 1 dx 
dt t 2 dt 


+ X ~ °' 


B. Some Basic Results of Sturm Theory 


Henceforth we write the self-adjoint second-order equation in the form 


d_ 

dt 


, dX 
P( '»¥ 


+ Q(t)* = 


(11.118) 


where P has a continuous derivative, Q is continuous, and P(t) is > 0 on a < t < b. 

In order to obtain our first result we shall need a well-known theorem on point sets 
which is known as the Bolzano-Weierstrass theorem. Suppose £ is a set of points on 
the t axis. A point t 0 is called a limit point of E if there exists a sequence of distinct points 
1 1 , t 2 , t 3 , . . . of E such that lim t n = t 0 . The Bolzano-Weierstrass theorem states that 

n-* oo 

every bounded infinite set E has at least one limit point. 

We now proceed to study Equation (11.118). 


THEOREM 11.35 


Hypothesis 


/. Let f be a solution of 


d_ 

It 


Pit ) 


dx 

Tt_ 


+ Q(t)x = 0, 


having first derivative f' on a < t < b. 

2. Suppose f has an infinite number of zeros on a < t < b. 


(11.118) 


Conclusion. Then f(t) = 0 for all t on a < t < b. 
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Proof. Since / has an infinite number of zeros on [a, b], by the Bolzano- 
Weierstrass theorem the set of zeros has a limit point t 0 , where t 0 e [a, b], Thus there 
exists a sequence {t n } of zeros which converges to t 0 (where t n # t 0 ). Since / is 
continuous, lim f(t) = f(t 0 ), where t -► t 0 through any sequence of points on [ a , b]. 

t->to 

Let t -► t 0 through the sequence of zeros {t n }. Then 


lim f(t) = 0 = /(t 0 ). 


Now since f'(t 0 ) exists, 


t->t 0 l — l o 


where t -► r 0 through the sequence {t n }. For such points 

— UlO! = o and thus /'(r 0 ) = 0. 

t — t o 

Thus / is a solution of Equation (11.118) such that f(t 0 ) = f'(t 0 ) = 0. Hence by the 
Corollary to Theorem 1 1.16 we conclude that 


f(t) = 0 for all t on a < t < b. 


Q.E.D 


THEOREM 11.36 Abel's Formula 
Hypothesis. Let f and g be any two solutions of 

j t Pit ) d f +G(0* = o (11.118) 

on the interval a < t < b. 

Conclusion. Then for all t on a < t < b, 

P(t)LAt)gXt)-r(t)g(t)^ = K (11.119) 

where k is a constant. 

Proof. Since / and g are solutions of (1 1.1 18) on a < t < b, we have 

j t lP(t)f'(t)-] + Q(t)f(t) = 0 (11.120) 

and 

j t LPM(t)l + Qmt) = 0 (11.121) 

for all t e [a, ft]. Multiply (11.120) by -g(t) and (11.121) by f(t) and add to obtain 
f(t)j t [P(t)gW - g(t) j t LP(t)fW = o. 


( 11 . 122 ) 
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Now integrate both sides of (11.122) from a to t. Using integration by parts, we 
obtain 


f(s)P(s)g'(s) 



P(s)g'(s)f'(s) ds - g(s)P(s)f'(s) 



P(s)f'(s)g'(s) ds = 0, 


which reduces to 

P(t)U(t)g'(t) - f\t)g(t)-] = P(a)lf(a)g'(a) - f'(a)g(a)l 
The right member is a constant fc, and thus we have Abel’s formula (11.119). 


Q.E.D. 


THEOREM 11.37 


A. Hypothesis. Let f and g be two solutions of 

dx 


d 

It 


Pit)— +Q(t)x = 0 


such that f and g have a common zero on a < t < b. 


(11.118) 


Conclusion. Then f and g are linearly dependent on a < t < b. 

B. Hypothesis. Let f and g be nontrivial linearly dependent solutions of Equation 
(11.118) on a < t < b, and suppose f(t 0 ) = 0 , where t 0 is such that a < t 0 < b. 


Conclusion. Then g(t 0 ) = 0. 

Proof 

A. We apply Abel’s formula 

P{t)m)g\t)-f\t)gm^k. (11.119) 

Let t 0 e [a, b ] be the common zero of / and g. Letting t = t 0 in the formula (1 1.1 19) we 
obtain k = 0. Thus 

- f\t)g(t )] = 0 for all t e [a, b\ 

Since we have assumed throughout that P(t) > 0 on a < t < b, the quantity in brackets 
above must be zero for all t on a < t < b. But this quantity is W(f g)(t). Thus by the 
corollary to Theorem 1 1.21, the solutions / and g are linearly dependent on a < t < b. 

B. Since / and g are linearly dependent on a < t < b, there exist constants c x and c 2 , 
not both zero, such that 


cj(t) + c 2 g(t) = 0, (11.123) 

for all r on a < t < b. Now by hypothesis neither / nor g is zero for all t on a < t < b. If 
c x = 0, then c 2 g(t) = 0 for all t on a < t < b. Since g is not zero for all t on [a, fc], we 
must have c 2 = 0, which is a contradiction. Thus c x # 0, and likewise c 2 # 0. Thus 
neither c x nor c 2 in (11.123) is zero. Since f(t 0 ) = 0, letting t = t 0 in (11.123), we have 
C 2 d(to) = 0. Thus g(t 0 ) = 0. Q.E.D. 
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► Example 1 1 .24 


The equation 



is of the type (11.118), where P(t) = Q(t) = 1 on every interval a < t < b. The linearly 
dependent solutions A sin t and B sin t have the common zeros t = ±nn (n = 
0, 1,2,...) and no other zeros. 


C. The Separation and Comparison Theorems 


THEOREM 11.38 Sturm Separation Theorem 
Hypothesis. Let f and g be real linearly independent solutions of 

£[*>«)§] + em* = ° dins) 

on the interval a < t < b. 

Conclusion. Between any two consecutive zeros of f there is precisely one zero of g. 

Proof. Let t l and t 2 be two consecutive zeros of / on [a, b~\. Then by Theo- 
rem 11.37, Part A, we know that g(t i) / 0 and g(t 2 ) # 0. Now assume that g has no 
zero in the open interval t x < t < t 2 . Then since the solutions / and g have con- 
tinuous derivatives on [a, &], the quotient f /g has a continuous derivative on the 
interval t l < t < t 2 . Further, f(t)/g(t) is zero at the endpoints of this interval. Thus by 
Rolle’s theorem there exists £, where t x < £ <t 2 , such that 



and thus since / and g are linearly independent on a < t < b, 

#0 on t l <t <t 2 . 

This contradiction shows that g has at least one zero in t t < t < t 2 . 

Now suppose g has more than one zero in t t < t < t 2 , and let t 3 and be two such 
consecutive zeros of g. Then interchanging / and g, the preceding paragraph shows 
that / must have at least one zero t 5 in the open interval t 3 < t < t 4 . Then t t < t 5 < t 2 , 
and so t l and t 2 would not be consecutive zeros of /, contrary to our assumption 
concerning t t and t 2 . Thus g has precisely one zero in the open interval t t < t < t 2 . 

Q.E.D. 


d_\m 

dt |_0(f) 
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We may restate Theorem 1 1.38 in the following form: The zeros of one of two real 
linearly independent solutions of Equation (1 1.1 18) separate the zeros of the other 
solution (see Figure 11.1). 


► Example 1 1 .25 


We have already observed that the equation 



= 0 


is of the type (11.118). The functions / and g defined, respectively, by f(t) = sin t and 
g(t) = cos t are linearly independent solutions of this equation. Between any two 
consecutive zeros of one of these two linearly independent solutions there is indeed 
precisely one zero of the other solution. 


THEOREM 11.39 Sturm's Fundamental Comparison Theorem 


Hypothesis. On the interval a < t < b, 


1. Let <j) x be a real solution of 


d_ 

It 



+ Qi(t) x — 0 . 


(11.124) 


2. Let 02 be a real solution of 


d_ 

It 


x dx 

Pm lu 


+ Qi{t)x = o. 


(11.125) 


3. Let P have a continuous derivative and be such that P(t) > 0, and let Q t and Q 2 be 
continuous and such that Q 2 (t ) > Qft). 


Conclusion. If t x and t 2 are successive zeros of (j) i on [ a , fc], then <f 2 has at least 
one zero at some point of the open interval t t < t < t 2 . 

Proof. Assume that 0 2 does not have a zero on the open interval t l < t < t 2 . Then 
without loss in generality we can assume that (frft) > Oand0 2 (O > Oon^ < t < t 2 . By 
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hypothesis, we have 

^[Wi(0] + e,w^(i) = o, (hi26) 

j t iPimm + 62(0*2(0 = 0, (n.127) 

for all t e [a, b]. Multiply (11.126) by 4> 2 (t) and (11.127) by <Mf), and subtract, to obtain 
4>2 (t)j t iP(t)<l>\(t)-] - ^(t)j t iP(tW 2 {m = [e 2 (t) - QtMfaMiit). (11.128) 


Since 

4> 2 (t) J t LP(tWi(tn - = J t {P(t)l<P\(t)4>2(t) - 

the identity (11.128) reduces to 

j t {p(tu\(t) 4 > 2 (t) - = [62(f) - 6i(t)]<Mt)</>2(f). 


We now integrate this from t Y to t 2 to obtain 


' 2 j t {pm\(t)4> 2 (t) - utmm * = 


[62(f)- Qi(t)]<t>i(t)<P 2 (t)dt 


or 


pm'iMiit) - 0i(t)^2(t)] 


<2 ro 

ii J< 1 


[62(0 - 6i(t)]0i(t)02(r) ^t- (11-129) 


Since ^(M = 0i(f 2) = 0, the equality (11.129) becomes 

PihWiMfaih) - P(tiWi(h)<l> 2 {ti) = [62(f) - 6i(f)]<Mf)tf>2(t) dt. (11.130) 

By hypothesis, P(t 2 ) > 0. Since (f>i{t 2 ) = ® anc ^ > 0 on t x < t < t 2 , we 
have (t>\{t 2 )<0. Since 0 2 (f) > 0 on t l <t<t 2 , we have 0 2 (f 2 ) > 0. Thus 
PttiWdhWiih) ^ 0- In like manner, we have F(ti)0i( f i)02( f i) ^ 0- Thus, the left 
member of (1 1.130) is not positive. 

But by hypothesis Q 2 (f) _ 6i(0 > 0 on t Y < t < t 2 , and so the right member of 
(11.130) is positive. Thus the assumption that 0 2 does not have a zero on the open 
interval t t < t < t 2 leads to a contradiction, and so 0 2 has a zero at some point of this 
open interval. Q.E.D. 


As a particular case of importance, suppose that the hypotheses of Theorem 11.39 
are satisfied and that t Y is a zero of both (j) l and </> 2 . Then if t 2 and £ are the “next” zeros 
of </>! and 0 2 , respectively, we must have £ < t 2 (see Figure 1 1.2). 


d 2 x y ~ 
+ A 2 x = 0 
dt 2 


► Example 1 1 .26 

Consider the equations 


(11.131) 
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Figure 1 1 .2 


and 


d 2 x _ 2 

w + x = ’ 


(11.132) 


where A and B are constants such that B > A > 0. The functions c/q and <p 2 defined 
respectively by <t> t (t) = sin At and (f> 2 (t ) = sin Bt are real solutions of these respective 
equations. Consecutive zeros of sin At are 


m t 

~A 


and 


(n + \)n 
A 


(n = 0, ± 1, +2,...). 


Theorem 1 1.39 applies, and so we are assured that sin Bt has at least one zero such 
that 


nn r (n + l)ir 
~A <( " < A 


(n = 0, ± 1, ±2,. 


In particular, t = 0 is a zero of both sin At and sin Bt. The “next” zero of sin At is n/A, 
while the “next” zero of sin Bt is n/B\ and clearly n/B < n/A. 


Exercises 


1. Find the adjoint equation to each of the following equations: 

/ v 7 d 2 x _ dx 

(a) t 2 —y + 3t — + 3x = 0. 

; dt 2 dt 

(b) (2t + l)0 + t3^ + x = O. 

(c) t 2 + (2r 3 + 7t) ^ + (8t 2 + 8)x = 0. 

at dt 

(d) t 3d ^-(t 2 + 2t 2 -t)^ + (t 2 + t-\)x = 0. 
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2. Show that the adjoint equation of the adjoint equation (11.11 1) of the equation 

(11.110) 


d 2 x 

IT 2 


dx 

dt 


«o(0 ~ t 2 + ^7 + a 2 (t) X = 0 


is the original equation (11.110) itself. 

3. Show that each of the following equations is self-adjoint and write each in the 
form (11.1 13). 

, „ , d 2 x „ , 4x 

(a) t 3 7T + 3t 2 — + x = 0. 
dt 2 dt 


(c) 


d 2 x dx 

77 + COS t — 
dt 2 dt 

t + l\ d 2 x 1 dx 1 


(b) sin t -jy + cos t — + 2x = 0. 


t 


dt 2 


t 2 dt 


+ tX = 0. 


4. Transform each of the following equations into an equivalent self-adjoint 
equation: 

. . , d 2 x dx 

(a) ' dF + '7i + x -° 

... h d 2 x , , dx 

(b) (t + t ) 2 — I - 2t — — h 3x = 0. 

dt 2 dt 

, d 2 x dx 

(c) -7-5- - tan t — + x = 0. 

dt 2 dt 


( d ) /(O^T + 0(O^ = O- 


5. (a) A first-order differential equation of the form 

dx 


dt 


+ a(t)x + b(t)x 2 + c(t) = 0 


is called a Riccati equation. Show that the transformation 

x dx 


u = - 


transforms the self-adjoint second-order equation 

dx 


d_ 

dt 


P{t)-- + Q(t)x = 0 


dt 


into the special Riccati equation 


du 1 9 x 

Ji + nD u+ m " 
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(b) Use the result of part (a) to transform the self-adjoint equation 

s['§] +|1 -' ,5 ‘-° (A) 

into a Riccati equation. For this Riccati equation find a solution of the 
form ct n . Then employ the transformation of part (a) to find a solution of 
Equation (A). Finally, use the method of reduction of order to obtain the 
general solution of Equation (A). 

6. Use the Sturm separation theorem to show that between any two consecutive 
zeros of sin 2 1 + cos 2 1 there is precisely one zero of sin It — cos It. 

7. (a) Show that every real solution of 

W + {t + i )x = 0 

has an infinite number of positive zeros. 

(b) More generally, show that if q is continuous and such that q(t) > 0 for t > 0 
and k is a positive constant, then every real solution of 

+ k 2 ]* = 0 

has an infinite number of positive zeros. 

8. Show that if q is continuous and q(t) < 0 throughout a < t < b, then no nontrivial 
solution of 

d^x 

-j-2 + 0 ( 0 * = 0 

can have more than one zero on a < t < b. 

9. Consider the equation 

d 2 x 

— + q(t)x = 0, (A) 

where q is continuous on a < t < b and such that 0 < m < q(t) < M. Let 0! 
be a solution of Equation (A) having consecutive zeros at t t and t 2 (where 
a < t t < t 2 < b). Show that 


Jm 


< 1 7 


, d 2 x 
1? 


[Hint: Consider a solution 0 2 of =r + Mx = 0 which is such that 0 2 (U) = 0, 


and apply Theorem 11.39.] 

10. Consider the differential equation 

d 2 x 

Tt 


2 + 0 ( 0 * = 0 , 


(A) 


where q is continuous and q(t) > 0 on a < t < b. Let q m denote the minimum 
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value of q{t) on a < t < b. Show that if q m > k 2 n 2 /(b — a) 2 , then every real 
solution of Equation (A) has at least k zeros on a < t < b. 


[Hint: Consider the zeros of the real solutions of 

// 2 y U 2 tt 2 

x = 0 


d 2 x k 2 n 2 

li T + (b 


af 


and apply Theorem 11.39.] 

11. Consider the Bessel differential equation 



+ t - + ( t 2 -p 2 )X = °, 


(A) 


where p is a parameter. Suppose t > 0; and let [a, a + 7t], where a > 0, be an 
arbitrary interval of length n on the positive t axis. Prove the following result: 

If p = 0, every such interval [a, a + n] contains at least one zero of any 
solution of the Bessel differential equation (A) of order zero; and if p > j, every 
such interval [ a , a + 7t] contains at most one zero of any nontrivial solution of 
the Bessel differential equation of order p. 

[Hint: According to the result stated in Exercise 2 of Section 6.3 the trans- 
formation 


y 


u(t) 




reduces Equation (A) to the form 






u = 0 


or, equivalently, 




4 p 2 - 1 
At 2 




u = 0. 


(B) 


Note that if a solution / of an equation of the form (B) has a zero at t 0 > 0, then 
the solution g such that g(t) = f(t)/yft of the corresponding Bessel equation (A) 
also has a zero at t 0 . Then apply Theorem 1 1.39 to compare the number of zeros 
of solutions of Equation (B) to the number of zeros of solutions of 


d 2 u 

Tt 


U u 

+ u = 0.] 


12. Consider the differential equations 


and 



+ Q i(0* = 0 


+ 6 2 (0* - 


(A) 

(B) 


where P has a continuous derivative and is such that P(t) > 0, and Q t and Q 2 are 
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continuous and such that Q 2 (t) > Qi(t) on a < t < b. Let t 0 and t t be such that 
a < t 0 < t t < b. Let 0 1 be the solution of Equation (A) such that <Mt 0 ) = c o> 
(j>\(t 0 ) = c l9 and let </> 2 be the solution of Equation (B) such that (j) 2 (t 0 ) = c o 
and <p' 2 (t 0 ) = where c 0 > 0 and > 0 if c 0 = 0. Suppose that (j) 2 (t) > 0 for 
t 0 < t < t v 

Show that (j )^ ) > (j) 2 (t ) for t 0 < t < 

[Hint: Proceed as in the proof of Theorem 1 1.39 to show that 

P(t)Wl(t)4>2(t) - <MO02(O] = P IQi(s) - Gl(s)D0i(s)0 2 (s) ds 

Jto 

for all t e [a, b~\. Thus show that the function h such that h(t) = (t>i{t)/(t> 2 (t) is 
increasing for t 0 < t < t l9 and observe the value ft(t 0 ).] 



CHAPTER TWELVE 

Sturm- Liouville Boundary - Value Problems and 
Fourier Series 


In Chapter 1 we encountered boundary-value problems consisting of a second-order 
linear differential equation and two supplementary conditions which the solution of 
the equation must satisfy. In this chapter we shall consider a special kind of boundary- 
value problem known as a Sturm- Liouville problem. Our study of this type of problem 
will introduce us to several important concepts including characteristic function , 
orthogonality, and Fourier series. These concepts are frequently employed in the 
applications of differential equations to physics and engineering. In Chapter 14 we 
shall use them to obtain solutions of boundary-value problems which involve partial 
differential equations. 


1 2.1 STURM— LIOUVILLE PROBLEMS 
A. Definition and Examples 

Our first concern in this chapter is a study of the special type of two-point boundary- 
value problem given in the following definition: 

DEFINITION 

We consider a boundary-value problem which consists of 
1. a second-order homogeneous linear differential equation of the form 

lx [ p(x * + [«(*) + Mx)]y = °. ( ' 1 2. i ) 

where p, q, and r are real functions such that p has a continuous derivative, q and r are 
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continuous , and p(x) > 0 and r(x) > 0 for all x on a real interval a < x <b; and X is a 
parameter independent of x ; and 
2. two supplementary conditions 

A I y(a) + A 2 y'(a ) = 0, 

B l m + B 2 Ab) = 0 , (112) 

where A l9 A 2 , B t , and B 2 are real constants such that A i and A 2 are not both zero and B t 
and B 2 are not both zero. 

This type of boundary-value problem is called a Sturm-Liouville problem (or Sturm- 
Liouville system). 

Two important special cases are those in which the supplementary conditions (12.2) 
are either of the form 


y{a) = 0, 

y(b) = 0 

(12.3) 

or of the form 



y'(a) = 0, 

y'(b) = 0. 

(12.4) 

► Example 12.1 



The boundary-value problem 



d 2 y 

dS +Xy 

= 0, 

(12.5) 

m = o, 

y(n) = 0 

(12.6) 


is a Sturm-Liouville problem. The differential equation (12.5) may be written 

^[ 1 '^] + [0 + ^' 1]3; = 0 

and hence is of the form (12.1), where p(x) = 1, q(x) = 0, and r(x) = 1. The 
supplementary conditions (12.6) are of the special form (12.3) of (12.2). 


► Example 12.2 


The boundary-value problem 

+ [2x 2 + Xx 3 ]y = 0, 

3y(l) + 4/(1) = 0, 

5/2) - 3/(2) = 0, 

is a Sturm-Liouville problem. The differential equation (12.7) is of the form (12.1), 
where p(x) = x, q(x) = lx 2 , and r(x) = x 3 . The conditions (12.8) are of the form (12.2), 
where a = 1, b = 2, A t = 3, A 2 = 4, B i = 5, and B 2 = —3. 


d dy 
dx X dx_ 


(12.7) 

(12.8) 
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Let us now see what is involved in solving a Sturm-Liouville problem. We must find 
a function / which satisfies both the differential equation (12.1) and the two sup- 
plementary conditions (12.2). Clearly one solution of any problem of this type is the 
trivial solution </> such that </>(x ) = 0 for all values of x. Equally clear is the fact that this 
trivial solution is not very useful. We shall therefore focus our attention on the seareh 
for nontrivial solutions of the problem. That is, we shall attempt to find functions, not 
identically zero , which satisfy both the differential equation (12.1) and the two 
conditions (12.2). We shall see that the existence of such nontrivial solutions depends 
upon the value of the parameter X in the differential equation (12.1). To illustrate this, 
let us return to the Sturm-Liouville problem of Example 12.1 and attempt to find 
nontrivial solutions. 


► Example 12.3 

Find nontrivial solutions of the Sturm-Liouville problem 

d 2 y 

+ Ay = 0, (12.5) 

y( 0) = 0, y(n) = 0. (12.6) 

Solution. We shall consider separately the three cases X = 0, X < 0, and X > 0. In 
each case we shall first find the general solution of the differential equation (12.5). We 
shall then attempt to determine the two arbitrary constants in this solution so that the 
supplementary conditions (12.6) will also be satisfied. 

Case 1 : X = 0. In this case the differential equation (12.5) reduces at once to 


and so the general solution is 

y = c l + c 2 x. (12.9) 

We now apply the conditions (12.6) to the solution (12.9). Applying the first condition 
y(0) = 0, we obtain c i = 0. Applying the second condition y(n) = 0, we find that 
+ c 2 n = 0. Hence, since c Y = 0, we must also have c 2 = 0. Thus ip order for the 
solution (12.9) to satisfy the conditions (12.6), we must have c x = c 2 = 0. But then the 
solution (12.9) becomes the solution y such that y(x) = 0 for all values of x. Thus if 
the parameter X = 0, the only solution of the given problem is the trivial solution. 

Case 2: A <0. The auxiliary equ ation of the differential equation (12.5) is 
m 2 + X = 0 and has the roots ± y/ — X. Since in this case X - 0, these roots are real and 
unequal. Denoting yf—X by a, we see that for X < 0 the general solution of (12.5) is of 
the form 

y = c t e ax + c 2 e~ ax . (12.10) 

We now apply the conditions ( 1 2.6) to the solution ( 1 2. 1 0). Applying the first condition 
y(0) = 0, we obtain 


Ci + c 2 = 0. 


(12.11) 
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Applying the second condition y(n) = 0, we find that 

+ c 2 e~ an = 0. (12.12) 

We must thus determine c t and c 2 such that the system consisting of (12.1 1) and (12.12) 
is satisfied. Thus in order for the solution (12.10) to satisfy the conditions (12.6), the 
constants c x and c 2 must satisfy the system of Equations (12.1 1) and (12.12). Obviously 
c i = c 2 = 0 is a solution of this system; but these values of c t and c 2 would only give 
the trivial solution of the given problem. We must therefore seek nonzero values of c x 
and c 2 which satisfy (12.11) and (12.12). By Chapter 7, Section 7.5C, Theorem A, this 
system has nonzero solutions only if the determinant of coefficients is zero. Therefore 
we must have 


But this implies that e an = e~ CLK and hence that a = 0. Thus in order for a nontrivial 
function of the form (12.10) to satisfy the conditions (12.6) we must have a = 0. Since 
a = X, we must then have X = 0. But X < 0 in this case. Thus there are no non- 
trivial solutions of the given problem in the case X < 0. 

Case 3: X > 0. Since X > 0 here, the roots ±yj — X of the auxiliary equation of 
(12.5) are the conjugate-complex numbers ± y/Xi. Thus in this case the general solution 
of (12.5) is of the form 

y = Ci sin + c 2 coSy/Xx. (12.13) 

We now apply the conditions (12.6) to this general solution. Applying the first 
condition y(0) = 0, we obtain 

c i sin 0 + c 2 cos 0 = 0 

and hence c 2 = 0. Applying the second condition y(n) = 0, we find that 

c x sitiy/Xn + c 2 coSy/Xn = 0. 

Since c 2 = 0, this reduces at once to 

c Y smy/~Xn = 0 (12.14) 

We must therefore satisfy (12.14). At first glance it appears that we can do this in either 
of two ways: we can set c i = 0 or we can set sinyfXn = 0. However, if we set c l = 0, 
then (since c 2 = 0 also) the solution ( 1 2. 1 3) reduces immediately to the unwanted trivial 
solution. Thus to obtain a nontrivial solution we can not set c l = 0 but rather we must 
set 

sin^/lTt = 0. (12.15) 

If k > 0, then sin kn = 0 only if k is a positive integer n = 1, 2, 3, ... . Thus in order to 
satisfy (12.15), we must have y/X = n, where n = 1, 2, 3, ... . Therefore, in order that the 
differential equation (12.5) have a nontrivial solution of the form (12.13) satisfying the 
conditions (12.6), we must have 

X = n 2 , where n = 1, 2, 3,.... (12.16) 

In other words, the parameter X in (12.5) must be a member of the infinite sequence 

1,4,9, 16,..., n 2 ,.... 
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Summary. If A < 0, the Sturm-Liouville problem consisting of (12.5) and (12.6) 
does not have a nontrivial solution; if A > 0, a nontrivial solution can exist only if A is 
one of the values given by ( 1 2. 1 6). We now note that if A is one of the values ( 1 2. 1 6), then 
the problem does have nontrivial solutions. Indeed, from (12.13) we see that nontrivial 
solutions corresponding to A = n 2 (n = 1, 2, 3, . . .) are given by 

y = c n sin nx (n = 1,2,3,...), (12.17) 

where c n (n = 1, 2, 3, . . .) is an arbitrary nonzero constant. That is, the functions defined 
by c t sin x, c 2 sin 2x, c 3 sin 3x, . . . , where c l5 c 2 , c 3 , . . . are arbitrary nonzero constants, 
are nontrivial solutions of the given problem. 


B. Characteristic Values and Characteristic Functions 

Example 12.3 shows that the existence of nontrivial solutions of a Sturm-Liouville 
problem does indeed depend upon the value of the parameter A in the differential 
equation of the problem. Those values of the parameter for which nontrivial solutions 
do exist, as well as the corresponding nontrivial solutions themselves, are singled out by 
the following definition: 


DEFINITION 

Consider the Sturm-Liouville problem consisting of the differential equation (12.1) and 
the supplementary conditions (12.2). The values of the parameter A in (12.1) for which 
there exist nontrivial solutions of the problem are called the characteristic values of the 
problem. The corresponding nontrivial solutions themselves are called the characteristic 
functions of the problem .* 


► Example 12.4 

Consider again the Sturm-Liouville problem 

d 2 y 

(12.5) 

y( 0) = 0, y(n) = 0. (12.6) 

In Example 12.3 we found that the values of A in (12.5) for which there exist nontrivial 
solutions of this problem are the values 

A = n 2 , where n = 1, 2, 3, .... (12.16) 

These then are the characteristic values of the problem under consideration. The 
characteristic functions of the problem are the corresponding nontrivial solutions 

y = c n sinnx (n = 1,2,3,...), (12.17) 

where c n (n = 1, 2, 3, . . .) is an arbitrary nonzero constant. 


The characteristic values are also called eigenvalues ; and the characteristic functions are also called 
eigenfunctions. 
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► Example 12.5 


Find the characteristic values and characteristic functions of the Sturm-Liouville 
problem 


d_ 

dx 



+ - y = 0. 
x 


/(l) = 0, y\e 2 *) = 0, 


(12.18) 

(12.19) 


where we assume that the parameter A in (12.18) is nonnegative. 


Solution. We consider separately the cases X = 0 and X > 0. If X = 0, the 
differential equation (12.18) reduces to 


d dy 
dx X dx 


= 0 . 


The general solution of this differential equation is 

y = C In |x| + C 0 , 

where C and C 0 are arbitrary constants. If we apply the conditions (12.19) to this 
general solution, we find that both of them require that C = 0 but neither of them 
imposes any restriction upon C 0 . Thus for X = 0 we obtain the solutions y = C 0 , where 
C 0 is an arbitrary constant. These are nontrivial solutions for all choices of C 0 / 0. 
Thus X = 0 is a characteristic value and the corresponding characteristic functions are 
given by y = C 0 , where C 0 is an arbitrary nonzero constant. 

If X > 0, we see that for x / 0 this equation is equivalent to the Cauchy-Euler 
equation 

x 2 + x^~ + Xy = 0. (12.20) 

dx dx 


Letting x = e\ Equation (12.20) transforms into 

d 2 y 

+ Xy = 0. (12.21) 

Since X > 0, the general solution of (12.21) is of the form 

y = c { sin + C 2 COS y/Xt. 

Thus for X > 0 and x > 0 the general solution of (12.18) may be written 

y = Ci sin( > /I In x) + c 2 cos ( X /A In x). (12.22) 

We now apply the supplementary conditions (12.19). From (12.22) we find that 


dy_ _ c 2x /I 
dx x 


cos 




!\/^ 


sin 


(v/A In x) 


(12.23) 


.for x > 0. Applying the first condition /(!) = 0 of (12.19) to (12.23), we have 


c \ \[^ cos( N /I In 1) — c 2 y/J. sin( x /X In 1) = 0 
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or simply c ly /X = 0. Thus we must have 

c x = 0. (12.24) 

Applying the second condition y\e 2n ) = 0 of (12.19) to (12.23), we obtain 
c ly /Xe~ 2n cos^X In e 2n ) — c 2y J~Xe~ ln sin [J~X In e 2n ) = 0. 

Since c i = 0 by (12.24) and In e 2n = 2n, this reduces at once to 

c 2y JXe~ 2n sin (In^fX) = 0. 

Since c Y = 0, the choice c 2 = 0 would lead to the trivial solution. 

We must have sin (In^fX ) = 0 and hence 2n^/~X = nn , where n = 1, 2, 3, ... . Thus in 
order to satisfy the second condition (12.19) nontrivially we must have 

X = ^- («= 1,2,3,...). (12.25) 

Corresponding to these values of X we obtain for x > 0 the nontrivial solutions 

>, = C„cos^~) <w= 1,2,3,...), (12.26) 


where the C n (n = 1, 2, 3, . . .) are arbitrary nonzero constants. 
Thus the values 


1 9 25 w* 

A U, , 1 , , 5 A > • • • 9 A J • • • 5 

4 4 4 4 


given by (12.25) for n > 0, are the characteristic values of the given problem. The 
functions 


C 0 , C x c ° s ( — ^ ^2 cos(ln x), C 3 cos 


given by (12.26) for n > 0, where C 0 , C l9 C 2 , C 3 ,... are arbitrary nonzero constants, 
are the corresponding characteristic functions. 

For each of the Sturm-Liouville problems of Examples 12.3 and 12.5 we found an 
infinite number of characteristic values. We observe that in each of these problems the 
infinite set of characteristic values thus found can be arranged in a monotonic 
increasing sequence* 

X x < X 2 < A 3 < 

such that X n -> + oo as n -► + oo. For example, the characteristic values of the problem 
of Example 12.3 can be arranged in the monotonic increasing sequence 

1 < 4 < 9 < 16 < ** (12.27) 

such that X n = n 2 -► + oo as + oo. We also note that in each problem there is a one- 

parameter family of characteristic functions corresponding to each characteristic 
value, and any two characteristic functions corresponding to the same characteristic 
value are merely nonzero constant multiples of each other. For example, in the problem 
of Example 12.3 the one-parameter family of characteristic functions corresponding to 
the characteristic value n 2 is c n sin nx , where c n / 0 is the parameter. 


An infinite sequence {x n } is said to be monotonic increasing if x n + 7 > x n for every n. 
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We might now inquire whether or not all Sturm-Liouville problems of the type 
under consideration possess characteristic values and characteristic functions having 
the properties noted in the preceding paragraph. We can answer in the affirmative by 
stating the following important theorem. 

THEOREM 12.1 

Hypothesis. Consider the Sturm-Liouville problem consisting of 
L the differential equation 

lx [ P ^ Jx ] + + y = °’ ( 1 2 - 1 ) 

where p , q , and r are real functions such that p has a continuous derivative , q and r are 
continuous , and p(x) > 0 and r(x) > 0 for all x on a real interval a < x < b; and X is a 
parameter independent of x ; and 
2. the conditions 


A t y(a) + A 2 y'(a) = 0, 

B iy (b) + B 2 y’(b) = 0, (112) 

where A t , A 2 , B i9 and B 2 are real constants such that A x and A 2 are not both zero and B l 
and B 2 are not both zero. 

Conclusions 

1. There exists an infinite number of characteristic values X n of the given problem. 
These characteristic values X n can be arranged in a monotonic increasing sequence 

Ai < X 2 < X 2 < 

such that X n -► + oo as n -+ + oo. 

2. Corresponding to each characteristic value X n there exists a one-parameter family 
of characteristic functions fi n . Each of these characteristic functions is defined on 
a < x < b, and any two characteristic functions corresponding to the same characteristic 
value are nonzero constant multiples of each other. 

3. Each characteristic function corresponding to the characteristic value X n (n = 
1, 2, 3, . . .) has exactly (n — 1) zeros in the open interval a < x < b. 

We regard the proof of this theorem as beyond the scope of this book; therefore we 
omit it. 

► Example 12.6 

Consider again the Sturm-Liouville problem of Examples 12.3 and 12.4, 

d 2 y 

+ ky = 0, (12.5) 

y( 0) = 0, y(n) = 0. (12.6) 

We have already noted the validity of Conclusions 1 and 2 of Theorem 12.1 for this 
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problem. The infinite number of characteristic values X n = n 2 (n = 1, 2, 3,...) can be 
arranged in the unbounded monotonic increasing sequence indicated by (12.27); and 
the characteristic functions c n sin nx(c n # 0), corresponding to X n = n 2 , possess the 
properties stated. 

We now illustrate Conclusion 3 by showing that each characteristic function 
c n sin nx corresponding to X n = n 2 has exactly (n — 1) zeros in the open interval 0 < 
x < n. We know that sin nx = 0 if and only if nx = kn , where k is an integer. Thus the 
zero of c n sin nx are given by 

kir. 

x = — (k = 0, ± 1, ±2,...). (12.28) 

n 

The zeros (12.28) which lie in the open interval 0 < x < n are precisely those for which 
k = 1, 2, 3,..., n - 1. Thus, just as Conclusion 3 asserts, each characteristic function 
c n sin nx has precisely (n — 1) zeros in the open interval 0 < x < n. 


Exercises 

Find the characteristic values and characteristic functions of each of the following 
Sturm-Liouville problems. 

1 . ^ + Xy- 0 , ,< 0 )- 0 , ,(§)-«. 

1 0 + ^ = °’ = °> /(*) = 0 . 
d 2 y 

3. — j + Xy = 0, y(0) = 0, y(L) = 0, where L > 0. 

d 2 y 

4. —y + Xy = 0, /(0) = 0, y'(L) = 0, where L > 0. 

dx 

5. + Ay = 0, y(0) = 0. y(n) - y\n) = 0. 

6 - + *y ~ °. y(0) - /(0) = 0, y(n) - y'(n ) = 0. 

7 - i[ xd £] + ^ ,=0 ’ r<,, = 0 ’ Ae ’ ,=a 

9 ■ + *°>=* 

[Hint: Let x = tan t.] 

m i[l^ri d l] + i ^ 2 + 1) y- 0 - > m ~ 0 ’ *>- a 

[Hint: Let t = x 3 + x.] 
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12.2 ORTHOGONALITY OF CHARACTERISTIC FUNCTIONS 
A. Orthogonality 

We now introduce the concept of orthogonality of functions in the following definition. 


DEFINITION 

Two functions f and g are called orthogonal with respect to the weight function r on the 
interval a < x < b if and only if 

f f(x)g(x)r(x) dx = 0. 


► Example 12.7 


The functions sin x and sin 2x are orthogonal with respect to the weight function 
having the constant value 1 on the interval 0 < x < n, for 


%7t 2 sin 3 

(sin x)(sin 2x)(l) dx = — 

o 3 


= 0 . 


DEFINITION 

Let {</>„}, n = 1,2, 3, . . . , be an infinite set of functions defined on the interval a < x < b. 
The set {</>„} is called an orthogonal system with respect to the weight function r on 
a < x < b if every two distinct functions of the set are orthogonal with respect to r on 
a < x < b. That is, the set {</>„} is orthogonal with respect to r on a < x < b if 

r b 

</> m (x)0 n (x)r(x) dx = 0 for m / n. 


► Example 12.8 


Consider the infinite set of functions {</>„}, where = sin nx(n = 1, 2, 3, . . .), on the 
interval 0 < x < n. The set {</>„} is an orthogonal system with respect to the weight 
function having the constant value 1 on the interval 0 < x < n, for 


Jo 2(m - n) 


sin(m -f n)x n 
2 (m + n) o 


for m / n. 


Note. In Examples 1 2.7 and 1 2.8 and also in all later examples in this chapter which 
involve orthogonality, the weight function r is the function having the constant value 1 
on the interval of the problem. Of course one should not conclude from this that the 



598 STURM— LIOUVILLE BOUNDARY-VALUE PROBLEMS AND FOURIER SERIES 

weight function r is always r(x) = 1 for all x under consideration in every problem 
involving orthogonality. For instance, see Exercise 3 at the end of this section and also 
the orthogonality relation involving the Bessel function of the first kind of order zero in 
Section 14.2D. 


B. Orthogonality of Characteristic Functions 

We now state and prove a basic theorem concerning the orthogonality of characteristic 
functions of a Sturm-Liouville problem. 


THEOREM 12.2 

Hypothesis. Consider the Sturm-Liouville problem consisting of 

1. the differential equation 

L p(x)^ + lq(x) + Mx)ly = o, (12.1) 

where p, q, and r are real functions such that p has a continuous derivative, q and r are 
continuous, and p(x) > 0 and r(x) > 0 for all x on a real interval a < x < b; and X is a 
parameter independent of x; and 

2. the conditions 

A l y(a) + A 2 y'(a) = 0, ^ 

Bi y(b) + B 2 y'(b) = 0, 

where A u A 2 , B l ,and B 2 are real constants such that A 1 and A 2 are not both zero and B i 
and B 2 are not both zero. 

Let X m and X n be any two distinct characteristic values of this problem. Let <j> m be a 
characteristic function corresponding to X m and let (f n be a characteristic function 
corresponding to X n . 

Conclusion. The characteristic functions </> m and <j) n are orthogonal with respect to 
the weight function r on the interval a < x < b. 

Proof. Since 0 m is a characteristic function corresponding to X m , the function 
satisfies the differential equation (12.1) with X = X m \ and since </>„ is a characteristic 
function corresponding to X n , the function </>„ satisfies the differential equation (12.1) 
with X = X n . Thus, denoting the derivatives of 0 m and by (j)' m and (p' n , respectively, we 
have 


+ D?M + 2 m r(x)]</> m (x) = 0, 

(12.29) 

f [p(x)^(x)] + [^f(x) + X„r(x)~\<p„(x) = 0, 

(12.30) 


for all x such that a < x < b. Multiplying both sides of (12.29) by 0„(x) and both sides of 
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(12.30) by 0 w (x) and then subtracting the results we obtain 

[pWmM] + L4>Jx)4>n(x)r(x) - <f>Jx) lp(x)4>' n (x)'] 


and hence 


= 0, 


d d 

U m ~ x„)<pjx)4>jx)r(x) = 4>Jx) — lp(x)4>' n (x)~\ - <p„(x) — [p(x)</»;(x)]. 


We now integrate both members of this identity from a to b to obtain 


(K 


-*r 


4>m(x)<l>„(x)r(x) dx 


■r 


<t>m(x) ^ lp(x)4>' n (x)] dx 


*b 

Ja 


4>n(x)-^lp(x)<i>'Jx)] dx. 


(12.31) 


Applying integration by parts to each integral in the right member of (12.31), this right 
member becomes 


4>m(x)p(x)<t>'„{x) 

or simply 


fc _ r 

a Ja 


p{x)d>'„(x)(t>' m (x) dx - <p„(x)p(x)(t>' m (x) 


b rb 
+ 

a Ja 


p(x)<l>' m (x)(l>'„(x) dx 


p(x) 


<t>m(xW n {x) - <t>„(x)(p 




Therefore the identity (12.31) becomes 


Ur, 




d>m(x)(t> n (x)r(x) dx = p(b)l4> m (b)4>'„(b) - 4> n (b)4>’m(b)'] 

~ P( a )L4> m (a)(t>' n (a) - <P„(a)<t> m (a)l (12.32) 


Since </> m and </>„ are characteristic functions of the problem under consideration, they 
satisfy the supplementary conditions (12.2) of the problem. If A 2 = B 2 = 0 in (12.2), 
these conditions reduce to y(a) = 0, y(b) = 0. Then in this case (j) m (a) = 0, (j) m (b) = 0, 
0„(a) = 0, and 4> n (b) = 0, and so the right member of (12.32) is equal to zero. 

If A 2 = 0 but B 2 / 0 in (12.2), these conditions reduce to y(a) = 0, j Sy(b) + y'(b) = 0, 
where = B l /B 2 . Then the second bracket in the right member of (12.32) is again equal 
to zero. Also, the first bracket in this member may then be written 


t(34> n (b) + 4>’*m 4>m(b) - [J Hm(b) + 

and so it is also equal to zero. Thus in this case the right member of (12.32) is equal to 
zero. 

In like manner, if either A 2 ^ 0, B 2 = 0 or A 2 / 0, B 2 ^ 0 in (12.2), then the right 
member of (12.32) is equal to zero. Thus in all cases the right member of (12.32) is equal 
to zero and so 

Um - K) | (pjx)(f> n (x)r{x) dx = 0. 

Since X m and X n are distinct characteristic values, their difference X m — X n ^ 0. Therefore 
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we must have 

j* 4>m(x)4>Jx)r(x) dx = 0, 

and so <j) m and </>„ are orthogonal with respect to r on a < x < b. Q.E.D. 

Let {A„} be the infinite set of characteristic values of a Sturm-Liouville problem, 
arranged in a monotonic increasing sequence < X 2 < ^3 < *• For each n = 

1, 2, 3, . . . , let (j) n be one of the characteristic functions corresponding to the character- 
istic value X n . Then Theorem 12.2 implies at once that the infinite set of characteristic 
functions (j) l , </> 2 , 0 3 ,. . . is an orthogonal system with respect to the weight function r 
on a < x < b. 


► Example 12.9 

Consider once more the Sturm-Liouville problem 

d 2 v 

^ = 0, (12.5) 

y( 0) = 0, y(n) = 0, (12.6) 

which we have already investigated in Examples 12.3, 12.4, and 12.6. Corresponding to 
each characteristic value X n = n 2 (n = 1, 2, 3, . . .) we found the characteristic functions 
c n sin nx(n = 1, 2, 3, . . .), where c n (n = 1, 2, 3, . . .) is an arbitrary nonzero constant. Let 
{<£„} denote the infinite set of characteristic functions for which c n = 1 (n = 1, 2, 3, . . .). 
That is, 

4> n (x) = sin nx (0 < x < n; n— 1, 2, 3, ... ). 

Then by Theorem 12.2, the infinite set {</>„} is an orthogonal system with respect to the 
weight function r, where r(x) = 1 for all x, on the interval 0 < x < n. That is, Theorem 
12.2 shows that 

J (sin mx)(sin nx)(l) dx = 0 (12.33) 

for m = 1, 2, 3, . . . ; n = 1, 2, 3, . . . ; m # n. We have already noted the validity of (12.33) 
in Example 12.8. 


Exercises 

Directly verify the validity of the conclusion of Theorem 12.2 for the characteristic 
functions of each of the Sturm-Liouville problems in Exercises 1, 2, and 3. 

\. 0 + Ay = 0, y(0) = 0, y(n/2) = 0. 

(See Exercise 1 at the end of Section 12.1.) 
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2 - Ly + Ay = 0, y'(0) = 0, y'(L) = 0, 

where L > 0. (See Exercise 4 at the end of Section 12.1.) 

3 - i[ xd i] + ^ y -°’ 

(See Example 12.5 of the text.) 

4. Let {P„}, n = 0, 1, 2, . . . , be an infinite set of polynomial functions such that 

1. P n is of degree n,n = 0, 1,2,..., 

2. P„(l) = 1, w = 0, 1,2,..., and 

3. the set { P n } is an orthogonal system with respect to the weight function r such 
that r(x) = 1 on the interval — 1 < x < 1. 

Construct consecutively the members P 0 , P l5 P 2 , and P 3 of this set by writing 

P 0 (x) = a 0 , 

Pi(x) = b 0 x + b u 

P 2 (x) = c 0 x 2 + c t x + c 2 , 

p 3 (x) = + d 2 x + d 3 , 

and determining the constants in each expression so that it has the value 1 at x = 1 
and is orthogonal to each of the preceding expressions with respect to r on 
-1 < x < 1. 


12.3 THE EXPANSION OF A FUNCTION IN A SERIES OF 
ORTHONORMAL FUNCTIONS 

A. Orthonormal Systems 

We proceed to introduce the concept of an orthonormal system. 

DEFINITION 

A function f is called normalized with respect to the weight function r on the interval 
a < x < b if and only if 

| lf(x)] 2 r{x)dx = 1. 

► Example 12.10 

The function / such that /(x) = yjl /n sin x is normalized with respect to the weight 
function having the constant value 1 on the interval 0 < x < n, for 
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DEFINITION 


Let {(/)„} (n = 1, 2, 3 ,...)be an infinite set of functions defined on the interval a < x < b. 
The set { } is called an orthonormal system with respect to the weight function r on 
a < x < b if (1) it is an orthogonal system with respect to r on a < x < b and (2) every 
function of the system is normalized with respect tor on a < x < b. That is, the set {(/)„} is 
orthonormal with respect to r on a < x < b if 


'b 

<l>m(x)(t>n(x)r{x) dx 

a 


0 for m / n, 

1 for m = n. 


► Example! 2. tt 


Consider the infinite set of functions {</>„}, where fi n (x) = -yjl/n sin nx (n = 1, 2, 3, . . .) 
on the interval 0 < x < n. The set {</>„} is an orthogonal system with respect to the 
weight function having the constant value 1 on the interval 0 < x < n, for 



(1) dx = 0 


for m / n. 


Further, every function of the system is normalized with respect to this weight function 
on 0 < x < 7r, for 

v 

sin nx 1 (1) dx = 1. 

Thus the set {</>„} is an orthonormal system with respect to the weight function having 
the constant value 1 on 0 < x < n. 



Now consider the Sturm-Liouville problem consisting of the differential equation 
(12.1) and the supplementary conditions (12.2). Let {A„} be the infinite set of char- 
acteristic values of this problem, arranged in a monotonic increasing sequence < 
^2 < ^3 < • • *. If (w = 1, 2, 3, . . .) is one of the characteristic functions corresponding 
to the characteristic value \ n , then we know from Theorem 12.2 that the infinite set of 
characteristic functions </> i9 </> 2 , </> 3 ,... is an orthogonal system with respect to the 
weight function r on a < x < b. But this set of characteristic functions is not necessarily 
orthonormal with respect to r on a < x < b. 

Now recall that if <£„ is one of the characteristic functions corresponding to X„, then 
k n (j) n , where k n is an arbitrary nonzero constant, is also a characteristic function 
corresponding to Thus from the given set of characteristic functions 0 1 , • • • > 

we can form a set of “new” characteristic functions k 2 (t) 2 , /c 3 0 3 ,...; and this 

“new” set is also orthogonal with respect to r on a < x < b. Now if we can choose the 
constants/c^ k 2 , /c 3 .... in such a way that every characteristic function of the “new” set 
is also normalized with respect to r on a < x < b, then the “new” set of characteristic 
functions k l fi l ,k 2 (t) 2 ,k 3 (t) 3 ,... will be an orthonormal system with respect to r on 
a < x < b. 

We now show that the constants k i9 k 2 , fe 3 , . . . can indeed be chosen so that the set 
k l (f) l ,k 2 (l) 2 ,k 3 (t) 3 ,...is orthonormal. Recall that the function r in the differential 
equation (12.1) is such that r(x) > 0 for all x on the interval a < x < b. Also recall that 
by definition no characteristic function fi n (n= 1,2,3,...) is identically zero on 
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a < x <b. Therefore 


and so 


Thus the set 


J> 


(x)] 2 r(x) dx = K„ > 0 (n= 1,2,3,...), 


fbkH’ 


r(x) dx = 1 (n =1,2,3,...)- 


: 4>U — F= 02, —7= 03, • • • 




is an orthonormal set with respect to r on a < x < b. We have thus shown that from a 
given set of orthogonal characteristic functions </> l5 <t> 2 , 03 , • • • , we can always form the 
set of orthonormal characteristic functions k i (f) l9 /c 2 </> 2 , ^ 303 > • • • > where 


k = 


y/K [<l>n{xY\ 2 r(x) dx 


(n = 1, 2, 3,...). 


(12.34) 


► Example 12.12 


The Sturm-Liouville problem consisting of the differential equation (12.5) and the 
conditions (12.6) has the set of orthogonal characteristic functions {</>„}, where 
</>„(x) = c n sin nx (n = 1, 2, 3, . . . ; 0 < x < n), and c n (n = 1, 2, 3, . . .) is a nonzero con- 
stant. We now form the sequence of orthonormal characteristic functions {k n </>„}, 
where k n is given by (12.34). We have 




-r 


(c n sin /ix) 2 (l) dx = 


cl it 



(n= 1,2, 3,...). 


Thus the Sturm-Liouville problem under consideration has the set of orthonormal 
characteristic functions were ^„(x) = yjljn sin nx (n = 1, 2, 3, . . . ; 0 < x < n). We 
observe that this is the set of orthonormal functions considered in Example 12.11. 


B. The Expansion Problem 

We now consider a problem which has many applications and has led to the 
development of a vast amount of advanced mathematical analysis. Let {</>„} be an 
orthonormal system with respect to a weight function r on an interval a < x < b, and 
let / be an “arbitrary” function. The basic problem under consideration is to expand 
the function / in an infinite series of the orthonormal functions </> 1? </> 2 , </> 3 , . . . . 
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Let us first assume that such an expansion exists. That is, we assume that 

fix) = i c„4> n (x) (12.35) 

n= 1 

for each x in the interval a < x < b. Now assuming that this expansion does exist, how 
do we determine the coefficients c n (n = 1, 2, 3,...)? Let us proceed formally for the 
moment, without considering questions of convergence. We multiply both sides of 

(12.35) by cj) k (x)r(x ), where r is the weight function and is the fcth function of the 
system {</>„}, thereby obtaining 

f(x)<p k (x)r(x) = £ c„<pjx)4) k (x)r{x). 

n= 1 

We now integrate both sides of this equation from a to b and obtain 


m b 

f(x)4> k (x)r(x) dx 

a 



4>n(x)4>k(x)r(x) 



c„<Pn(x)<l> k (x)r(x) dx, 


assuming that the integral of the sum in the middle member is the sum of the integrals 
in the right member. Under this assumption we thus have 


f 


f(x)4> k {x)r(x) dx = Yj c „ 


(t>n(x)4> k (x)r(x) dx. 


(12.36) 


But {</>„} is an orthonormal set with respect to r on a < x < b, and so 


m b 

dx 

a 


0 for n # k , 

1 for n — k. 


Therefore every term except the fcth term in the series in the right member of (12.36) is 
zero, and the kth term is simply c k • 1 = c k . Thus (12.36) reduces to 

f f(x)<p k (x)r(x) dx = c k . 


This is a formula for the fcth coefficient (k = 1, 2, 3, . . .) in the assumed series expansion 

(12.35) . Thus under suitable convergence conditions the coefficients c n in the expansion 

(12.35) are given by the formula 


c 


n 


f f(x)4>„(x)r(x) dx 


a 


(«= 1,2, 3,...). 


(12.37) 


oo 

In particular, we note that if the series £ c n 4> n (x) converges uniformly to /(x) on 

n = 1 

a < x < b, then the above formal procedure is justified and we are assured that the 
coefficients c n are given by the formula (12.37). 

Now, given an orthonormal system {</>„} and a function f we can form the series 

1 cAn, (12.38) 

n= 1 

where c n (n = 1, 2, 3, . . .) is given by formula (12.37). But note carefully that this formula 
was obtained under the assumption that / is representable by an expansion of the form 
(12.38). Therefore in general the expansion (12.38) obtained using formula (12.37) is 
indeed a strictly formal expansion. That is, in general we have no assurance that the 
series (12.38) determined by (12.37) converges pointwise on [ a , b]. Further, if this series 
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does converge at a point x on [a, b], we have no assurance that it converges to /(x) at 
this point. To be assured that the formal expansion (12.38) does converge to /(x) at 
every point x of [a, b], the functions / and {</>„} must satisfy certain restrictive 
conditions. It is beyond the scope of this book to attempt to give a detailed discussion 
of the various conditions on / and {</>„} which are sufficient for the convergence of this 
expansion. However, we shall state without proof one basic convergence theorem 
concerned with the case in which the system {</>„} is the system of orthonormal 
characteristic functions of a Sturm-Liouville problem. 

THEOREM 12.3 
Hypothesis 

1. Consider the Sturm-Liouville problem 

f [ P ^ Jx\ + + ^ x ^ 3 ’ = °’ 

A t y(a) + A 2 y'(a) = 0, 

B t y(b) + B 2 y'(b) = 0, 

where p, q , and r are real functions such that p has a continuous derivative , q and r are 
continuous , and p(x) > 0 and r(x) > 0 for all x on the real interval a < x < b. 

Let {X n } (n = 1, 2, 3,...) be the infinite set of characteristic values of this problem, 
arranged in a monotonic increasing sequence A 1 <A 2 <^ 3 <‘“- Let {</>„} (n = 
1 , 2 , 3 ,...) be the corresponding set of orthonormal characteristic functions of the given 
problem. 

2. Let f be a function which is continuous on the interval a < x < b, has a piecewise 
continuous derivative /' on a < x < b, and is such that f(a) = 0 if fifa) = 0 and f(b) = 0 

if = 0. 

Conclusion. The series 


£ C n <t>n , 

n = 1 

where 

c n = j* f(x)4>n(x)r(x) dx (n = 1,2,3,...), 
converges uniformly and absolutely to f on the interval a < x < b. 

► Example 12.13 

x 2 ,0 < x < 7i, in the 
Liouville problem 

(12.5) 

(12.6) 


Obtain the formal expansion of the function f where /(x) = nx — 
series of orthonormal characteristic functions {</>„} of the Sturm- 

d 2 v 

m - o. >'<«> - o. 

Discuss the convergence of this formal expansion. 
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Solution. In Example 12.12 we found that the orthonormal characteristic functions 
{cj) n } of the problem under consideration are given by 

[2 

</>„(x) = — sin nx; 0 < x < n (n = 1, 2, 3, . . .). 

Y n 

We now form the series 


Z C n4>n(x), 


(12.39) 


where for each n = 1, 2, 3, , 


c„ = 


f(x)4>„(x)r(x) dx 


(nx — x 2 )l — sin nx (1) dx 


-i: 


x sin nx dx — x 2 sin nx dx 


[2 f / n . nx \ K fix . 

V^IA" « Jo V» 

-JK-? 


sin mx H — ^ cos mx cos nx 

n* n 


D 


cos nn I — { — o cos nn cos nn t 

1 x n 5 n m 3 


2 2 

= / — * — 3 (1 — cos nn) 


if m is odd, 


n n 

F-A 

' n m J 

[^0 if m is even. 

Thus the series (12.39) becomes 


We write 


^ 8 8 ^ sin(2M — l)x 

> — t sin mx or - > — r — . 

n=i nn 3 7r „ = i (2 m - l) 3 

(nodd) 


, 8 £ sin(2n - l)x 

nx — x ~ 7 z m > U < x < 7i, 


(12.40) 


n n = i (2 n - l) 3 


to denote that the series (12.40) is the formal expansion of / in terms of the 
orthonormal characteristic functions {</>„}. We note that the first few terms of this 
expansion are 

8 Tsinx sin 3x sin 5x T 

k 1 3^ 1 5^ *" J 

Let us now discuss the convergence of this formal expansion. We note first that the 
coefficients p, q , and r in the differential equation (12.5) satisfy Hypothesis 1 of 
Theorem 12.3. Let us now see if the function / satisfies Hypothesis 2. Since / is a 
polynomial function, it certainly satisfies the requirements of continuity and a piece- 
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607 


wise continuous derivative on 0 < x < n. Hypothesis 2 also requires that /( 0) = 0 if 
<^(0) = 0 and f(n) = 0 if <t>i{n) = 0. These requirements are also fulfilled, for 


<M 0) 
<t>M) 


- sin 0 = 0, /(0) = 0, 

\j 71 



o, 


f(n) = n 2 — n 2 = 0. 


Thus all the hypotheses of Theorem 12.3 are satisfied and we conclude that the series 
(12.40) converges uniformly and absolutely to nx — x 2 on the interval 0 < x < n. Thus 
we may write 


nx — x 2 


n „ = i 


sin(2n — l)x 
(2n — l) 3 ’ 


0 < x < n. 


Exercises 


1. Consider the set of functions {</>„}, where 



(t>n+ lM = 



(n= 1,2, 3,...), 


2 . 


on the interval 0 < x < n. Show that this set {<£„} is an orthonormal system with 
respect to the weight function having the constant value 1 on 0 < x < n. 

Obtain the formal expansion of the function / defined by f(x) = x(0 < x < n), in a 
series of orthonormal characteristic functions {</>„} of the Sturm-Liouville 
problem 


d\ 

dx 


2 + ly = 0, 


y( 0) = o, 


y(^) = 0. 


3. Obtain the formal expansion of the function / defined by f(x) = 1(1 < x < e”), in 
a series of orthonormal characteristic functions {</>„} of the Sturm-Liouville 
problem 


d_ 

dx 




y = o, 


y( i) = o, 


y(e n ) = o. 

(See Exercise 7 at the end of Section 12.1.) 

4. Obtain the formal expansion of the function / defined by f(x) = In x, 1 < x < e 2n , 
in a series of orthonormal characteristic functions {</>„} of the Sturm-Liouville 



608 STU RM—LIOU VI LLE BOUNDARY-VALUE PROBLEMS AND FOURIER SERIES 


problem 


d_ 

dx 


dy_ 

dx 


+ -y = o, 

X 


/(l) = 0, 

y\e 2n ) = o, 


where X > 0 (see Example 12.5 of the text). 


12.4 TRIGONOMETRIC FOURIER SERIES 

A. Definition of Trigonometric Fourier Series 

In Section 12.3 we introduced the problem of expanding a function / in a series of the 
form 


I c„4>„, (12.38) 

n= 1 

where {</>„} (n = 1, 2, 3,... ) is an orthonormal system with respect to a weight function r 
on a < x < b. We first assumed that an expansion for / of the form (12.38) does indeed 
exist. Then, assuming suitable convergence, we found that the coefficients c n in (12.38) 
are given by 

C n = J f( x )4>n(x)r{x) dx ( n = 1, 2, 3,...). (12.37) 

We noted that in general this is a strictly formal expansion and that certain restrictions 
must be imposed upon / and {</>„} in order to be assured that the series (12.38) thus 
formed does indeed converge to / on a < x < b. Nevertheless, assuming that the 
functions involved in (12.37) are integrable, we can determine the formal expansion 
(12.38) and give it a name, even though we have no assurance in advance that it actually 
represents the given function / in any sense. This is essentially what we do in the 
following definition. 


DEFINITION 


Let .{0„} (n = 1, 2, 3, . . .), be an orthonormal system with respect to a weight function r on 
a < x < b. Let f be a function such that for each n = 1, 2, 3,..., the product ff> n r is 
integrable on a < x < b. Then the series 


where 


oo 

yi Cn 4*n 9 


n = 1 


(12.38) 


c n = f(x)<t> n (x)r(x) dx (n= 1,2,3,...). 


(12.37) 


is called the Fourier series of f relative to the system {</>„}; the coefficients c n are called the 
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Fourier coefficients of f relative to {</>„}; and we write 

co 

fix) ~ £ c„4> n (x), a < x < b. 


We now proceed to introduce an important special class of Fourier series. For this 
purpose, consider the system of functions {^„} defined by 


ipfx) = 1 , 


I A 2 „M = cos — ( n = 1, 2, 3, . . .), 


(12.41) 


for all x on the interval - L < x < L, where L is a positive constant. Since 
f L mnx nnx . _ 


L 

cos 

L 

mnx 

ein 

nnx 

1 L 

oil! 

L 

mnx 

sin 

mix 


we see that the system (12.41) is orthogonal with respect to the weight function r defined 
by r(x) = 1 on — L < x < L. Now note that 


(l ) 2 dx = 2 L, 


C 0 S 2 ( _ L~/^x = ^ (n = 1, 2, 3,...), 


dx = L (n = 1, 2, 3,...). 


Thus the corresponding orthonormal system is the system of functions {</>„} defined by 

<t> 2 „(x) = - 7 = COS ^ (n = 1, 2, 3, . . .), (12.42) 

V L L 

02 »+iW = -^sin^ (n = 1,2,3,...), 
on the interval — L < x < L. 

Now suppose / is a function such that the product f<j) n is integrable for each function 
cj) n of the system (12.42) on the interval — L < x < L. Then we can form the Fourier 
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series of / relative to the orthonormal system {</>„} defined by (12.42). This is of the form 
(12.38), were the coefficients c n are given by (12.37). Thus using (12.37), with r(x) = 1, we 
find the Fourier coefficients 


c, = 


/(*)</> 


Mix ~: mi- 


C 2n ~ 


s/ 2 L j- 

1 

r* i 

C 2n +l=J f(x)(l)2n + l(x)dx=—^= 


f(x) dx. 


f(x)4> 2 „(x) dx = -J= | f(x) cos ~ dx. 


(12.43) 


f(x) sin ^ dx. 


(n = 1, 2, 3,...). Thus the Fourier series of / relative to the orthonormal system {</>„} 
defined by (12.42) is 


Z C n<t>n, 

n= 1 

where c n (n = 1, 2, 3, . . .) is given by (12.43) and (n = 1, 2, 3, . . .) is, of course, given by 
(12.42). We now write this series in the form 


c 101 + Yj ( C 2n ( t ) 2n + C 2n+ l ( t ) 2n+ l)* 

« = 1 

Substituting c n (n = 1, 2, 3,.. .) given by (12.43) and (n = 1, 2, 3,...) given by (12.42) 
into this series, it takes the form 



f(x) dx 




Rewriting this slightly, it assumes the form 


1 T 1 f L r, \ a 1 . v IT 1 f L \ nnx a 1 nnx 

2\X) /W d*J+ ^ /(*) cos — dx J cos — 


+ 



,, X ‘ H7lX A 

J (x) sin — — dx 



Thus the Fourier series of / relative to system (12.42) may be written as 


1 

2 fl0 + 


00 


z 


( nnx . . nnx\ 

a n cos — + b n sin — 1, 


where 




nnx . 

fix) COS 

J-L 

— — dx 

C L 

nnx 1 

I fix) Sin 

—j— dx 


(w = 0, 1, 2, 3,...), 
(*=1,2, 3,...). 


This special type of Fourier series is singled out in the following definition. 
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DEFINITION 


Let f be a function which is defined on the interval — L < x < L and is such that the 
integrals 


% L 

J -L 


f (x) cos dx 


and 


% L 

J -L 


f (x) sin Y ^~ dx 


(n = 0, 1,2,...) exist. Then the series 


1 ® ( nnx f . nnx 

-a 0 + X y a n cos -y- + b„ sin — 


(12.44) 


where 




f (x) cos dx 

Ld 


,, v . nnx 
J(x) s\n—j— dx 


(n = 0,1,2,...), 
(n = 1, 2, 3,...), 


(12.45) 


is cd/ed t/ie trigonometric Fourier series 0 / / on the interval — L < x < L. We denote 
this by writing 


n = 1 


ri \ 1 / nnx 

f(x) ~ t a 0 + X cos — + b n sin 


nnx 


— L < x < L. 


The numbers a n (n = 0, 1,2,...) and b n (n = 1, 2, 3, . . .) defined by (12.45) will be called 
simply the Fourier coefficients of f. 


Thus to find the trigonometric Fourier series of a function / having the required 
integrability properties on — L < x < L, we determine the Fourier coefficients from 
(12.45) and substitute these coefficients into the series (12.44). There are two important 
special cases in which the determination of the Fourier coefficients is considerably 
simplified. These are the cases in which / is either a so-called even function or a so- 
called odd function. 

A function F is said to be an even function if F( — x) = F(x) for every x. If F is an even 
function, the graph of y = F(x) is symmetric with respect to the y axis and 



F(x) dx = 2 


j F(x)dx 


for every c > 0. A function F is said to be an odd function if F( — x) = — F(x) for every x. 
If F is an odd function, the graph of y = F(x) is symmetric with respect to the origin and 

f F(x) dx = 0 


for every c > 0. 

Now suppose we wish to find the trigonometric Fourier series of an even function f. 
Then each function F n defined by 


Fn(x) = fix) COS — 


(n = 0,1,2,...) 
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is also even, since 


Therefore 


and so 


F n (~x) = f(-x) cos — = f(x) cos = F n (x) 


F n (x) dx = 2 F n (x) dx ( n = 0, 1, 2, . . .). 


nnx , 2 


a ” = Lj /( x > cos -^ ~ dx = i] f(x) cos — dx (n = 0,1,2,.. 

Further, each function G„ defined by 

nnx 

G n (x) = f(x) sin— (n= 1,2,3,...) 

is odd, since 

G„( - x) = /( - x) sin f = -/(x) sin ^ = - G„(x). 


Therefore 


and so 


G n (x) dx = 0 (w = 1, 2, 3,...), 


— J /(x)sin — dx = 0 (w= 1,2,3,...). 

Thus the trigonometric Fourier series of an even function / on the interval 
— L < x < L is given by 


where 


1 £ nnx 

x flo + L cos — , 

z „ = i 


= T f(x)cos——dx (n = 0,1,2,...), 


(12.46) 


(12.47) 


In like manner, one finds that the trigonometric Fourier series of an odd function / 
on the interval — L< x < L is given by 


where 


£ , . nnx 

L K sin—, 
1 ^ 


&„ = 7 /(x) sin — — dx (n= 1,2,3,...). 


(12.48) 


(12.49) 


We note that in general the trigonometric Fourier series of a function / is a strictly 
formal expansion, for we have said nothing concerning its convergence. We shall 
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discuss this important matter briefly in Part D of this section. In the meantime, let us 
gain some facility in finding trigonometric Fourier series by considering the following 
examples. 


B. Examples of Trigonometric Fourier Series 
► Example 12.14 

Find the trigonometric Fourier series of the function / defined by 

/(*) = M, — 71 < X < 7T, 

on the interval —n<x<n (see Figure 12.1). 


Solution. In this problem L = n. Since /( — x) = |— x| = |x| = /(x) for all x, the 
function / is an even function. Therefore the trigonometric Fourier series of / on 
“7i < x < 7r is gi ven by (12.46) with L = n, where the coefficients are given by (12.47) 
with L = 7i. Thus, since |x| = x for 0 < x < n, we have 

2 f* 2 f* 

a 0 = - /(x) dx = - x dx = n, 

n Jo Jo 

and 


nx x sin nx 

> 1 


a n = — ( /(x) cos nx dx = — ( x cos nx dx = — 

* Jo n Jo 71 I _ n 

_ 2 [cos mi - ll 2 T(— 1)" — 11 

" 7i L J? J " n I n 1 J 


4 

j, n odd 

nn 2 (n = 1,2,3,...). 

0, n even 



Figure 12.1 




► Example 12.15 

Find the trigonometric Fourier series of the function / defined by 

f(x) = x, — 4 < x < 4, 
on the interval — 4 < x < 4 (see Figure 12.2). 



Figure 1 2.2 


Solution. In this problem L = 4. Since /( — x) = — x = — /(x) for all x, the function 
is an odd function. Therefore the trigonometric Fourier series of / on — 4 < x < 4 is 
given by (12.48) with L = 4, where the coefficients b n are given by (12.49) with L = 4. 
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Thus we have 


b 


n 


2 

4 


*4 

Jo 


/Y . ‘ nUX A 

j(x) sin — ax 


i /M 

1 . nnx 

- x sin — — ax 

2 Jo 4 


ir i6 

2 _n 2 7i 2 


.nnx Ax 

sin — cos 

4 H7T 


ttTtX 

4 _o 


8 


cos nn 

nn 


8(— 1)” _ 8(-l)" + 1 
nn nn 


(n = 1, 2, 3,...). 


Thus the required series is 


8 » (-1)" + 1 . nnx 

- ) sin—— 

n n = 1 n 4 


8 f . nx 1 . nx 1 . 3tcx 

- sin— -sin — + -sin— — 

n 4 2 2 3 4 


and we write 


x 


- z 

n n=l 


(- 1 )' 


n+ 1 


sin 


nnx 


-4 < x < 4. 


► Example 12.16 


Find the trigonometric Fourier series of the function / defined by 

— n < x < 0, 

[x, 0 < x < n, 

on the interval —n<x<n (see Figure 12.3). 


/(*) = 


(12.50) 



Solution. In this problem L = n. The function / defined by (12.50) is neither even 
nor odd. Therefore the trigonometric Fourier series of this function on — n < x < n is 
given by (12.44) with L = n, where the coefficients a n and b n are given by (12.45) with 
L = 7i. We find 




/(x) dx = 


1 


n dx + 



3n 

T ; 
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cos nx dx + 


if" i rr° 

a n = — f{x) cos nx dx = — n 

nj-K n U-n 

1 jr 7i sin nx~\° |~ cos nx x sin nx 1* 

n lL » J-* + L « 2 + « Jo 

1 |~COS «7l — 1”| 

= « |_ n 1 J = 




cos nx dx 


(- 1 )"- 1 


/t odd 


^ (n= 1,2,3,...); 

0, ft even 


sin nx dx + 


If* 1 ff° 

b n = —\ f(x) sin nx dx = — 71 

^ LJ - K 

1 (T ncosnx 0 Tsinnx x cos nx 71 1 

71 lL W j-n + L n Joj 


x sin nx dx 


1 / 71 

7r \ n 


1 


Thus the required series is 
1 


371 ® 

4 + .?, 

3n 


cos nx 


nn 


(n = 1, 2, 3,...). 


1 • 1 
sin nx 

n J 


= — — — cos x + sin x — - sin 2x - ( — cos 3x + - sin 3x - - sin 4x — 


1 


9n 


3' 


1 . 


Now that we have gained some familiarity with the actual details of finding 
trigonometric Fourier series, we call attention to a matter which is of considerable 
importance in this connection. Suppose / is a function defined and integrable on an 
interval a < x < b, and suppose g is a function such that g(x) = f(x) for all except a 
finite number of points of the interval a < x < b. Then it is shown in advanced calculus 
that 

f g(x) dx = f f(x)dx. 


Now suppose / is a function which is defined on the interval — L < x < L and is such 
that the integrals 



/(x) cos 


nnx 

~L 


dx 


and 


-l 


/Y X * n7lX A 

f(x) sin -j— dx 


(n = 0, 1, 2, . ..) exist, and suppose g is a function such that g(x) = f(x) for all except a 
finite number of points of — L < x < L. Then 


dx (n = 0, 1, 2,...). 

dx (n = 1, 2, 3,...). 

The left members of these two identities are the Fourier coefficients of g , and the right 
members are the Fourier coefficients of f Thus / and g have the same Fourier 
coefficients (12.45) and hence the same trigonometric Fourier series (12.44). We thus see 
that if / and g are two functions which have the same values at all except a finite number 


1 

L ( 
1 

L 


L , nnx 1 i L .. 
g(x) cos — dx = — I f(x) c( 

nnx 1 f L . . 
— dX =L 


g{x) sin 


nnx 


nnx 
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of points of the interval — L < x < L, then / and g have the same trigonometric 
Fourier series on — L < x < L, assuming that the Fourier coefficients of either of these 
functions exist. 


► Example 12.17 


In Example 12.16 we found the trigonometric Fourier series of the function / defined 
by 


/(*) = 


\n, —n < x < 0, 

[x, 0 < x < n, 

on the interval —n<x<n (see Figure 12.3 again). The series is 
37 


5tt « f< 

*+&[ 


(-I)’ - I 1 . 

5 — cos nx — sin nx 

nn n 


Now consider the function g defined by 


<?m = 


( 

n, 

— n < x < 0, 

n 

x = 0, 

V 


0 < X < 7T. 


(12.50) 


(12.51) 


(See Figure 12.4 and compare with Figure 12.3.) 

The functions / and g have the same values except at x = 0; for /(x) = g(x) = n, 

— n < x < 0, and / (x) = g(x) = x, 0 < x < n, but /(0) = 0 and ^(0) = tt/ 2. Thus, since 
/ and g have the same values at all except a finite number (in this case, one) of points of 

— ul < x < 7t, they have the same trigonometric Fourier series on this interval. Thus the 
trigonometric Fourier series of the function #on - 7i<x<7cis also the series (12.51). 



Exercises 

For each of the functions / defined in Exercises 1-8 find the trigonometric Fourier 
series of / on — n < x < n. 

1. /(x) = x, —n<x<n. 
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2 . f(x) 

3 - fix) 
4 . fix) 

5 - fix) 

6 - fix) 

7 - fix) 

8 - fix) 


x 2 , —n<x<n. 

J — 2, — 7i < x < 0, 

{ 2, 0 < X < 71. 

fO, - n < x < 0, 

[2, 0 < x < n. 

JO, — 71 < x < 0, 
[x 2 , 0 < x < n. 

0, — n < x < — 7r/2, 

1, —nil < x < 0, 

2, 0 < x < 7 t/2, 

3, 7r/2 < x < n. 

x|x|, —n<x<n. 

e x , —n<x<n. 


For each of the functions / defined in Exercises 9-16 find the trigonometric Fourier 
series of / on the interval stated. 

9. /(x) = |x|, — 6 < x < 6. 

10. /(x) = 2x + 1, — 1 < x < 1. 

fO, — 3 < x < 0, 

[ 1, 0 < x < 3, 

— 2, - 4 < x < 0, 

0, x = 0, 

2, 0 < x < 4, 

(0, — L < x < 0, 


11 . f(x) = 

12. /(x) = 


on the interval - 3 < x < 3. 


on the interval -4 < x < 4. 


13. f(x) = 

14. /(x) = 


L, 0 < x < L. 

0, — L < x < - L/2, 


on the interval — L < x < L. 

on the interval — L < x < L. 


[1, —L/2 < x < L, 

15. /(x) = ax + b, — L < x < L, where a and b are constants. 
1, — L < x < 0. 


16. /(x) = 


L — x, 0 < x < L, 


on the interval — L < x < L. 


C. Fourier Sine Series and Fourier Cosine Series 

In Example 12.12 we noted that the Sturm-Liouville problem 

g-Wy-0, (125) 


y(0) = 0, J'(TT) = 0 


( 12 . 6 ) 



12.4 TRIGONOMETRIC FOURIER SERIES 619 


has the set of orthonormal characteristic functions {ij/ n } defined by 


W*) = 



0 < x < n (n = 1, 2, 3,...). 


If we replace the second condition (12.6) by the more general condition y(L) = 0, where 
L > 0; we obtain the problem 


df 

dx 


2 + Xy = 0, 


Y( 0) = 0, y(L) = 0. 

This problem has the set of orthonormal characteristic functions {</>„} defined by 


2 . nnx 

<t>n(x)= It sin 


L ’ 


0 < x < L (n = 1, 2, 3,...). 


(12.52) 


Let us now consider the problem of expanding an “arbitrary” function / which is 
defined on 0 < x < L in a series of these orthonormal functions {</>„}. In other words, 
given a function / defined on 0 < x < L, let us find the Fourier series of / relative to 
the orthonormal system {</>„} defined by (12.52). From the definition of Fourier series 
we see that the desired series is of the form 


n = 1 

where </>„(«= 1, 2, 3, . . .), is given by (12.52) and 


C» [2 nnx 

Cn = I f(x)(t>„(x)r{x) dx = I f(x) /- sin — dx (n = 1, 2, 3, . . .)• 


Thus the series takes the form 




or 


where 


£ . . nnx 

X b n sin — 

n= 1 L 


nnx 


This special type of Fourier series is identified in the following definition. 


f(x) sin ~j^~dx (n = 1, 2, 3,...). 


DEFINITION 

Let f be a function which is defined on the interval 0 < x < Land is such that the integrals 
L nnx 

f(x) sin — — dx (n = 1, 2, 3,. . .) 

Jo L 
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exist , Then the series 


00 


I 

, = 1 


. . nnx 

K sin — , 


(12.53) 


where 



f(x) sin — — dx 
o L 


(n = 1, 2, 3, . . .), 


(12.54) 


is called the Fourier sine series of f on the interval 0 < x < L.We indicate this by writing 


fix) ~ X b„ sin 


nnx 


0 < x < L. 


We thus see that a function / having the required integrability properties on 
0 < x < L can be formally expanded into a sine series of the form (12.53) with 
coefficients given by (12.54). We observe that this expansion is identical with the 
trigonometric Fourier series (12.48) of the odd function which is defined on 
— L< x < L and which coincides with / on 0 < x < L. 

Now suppose that we wish to expand a function / defined on 0 < x < L into a series 
of cosines instead of a series of sines. If we consider the Sturm-Liouville problem 


d 2 y 

dx 2 


+ — 0 , 


y'( 0) = 0, y\L) = 0 (where L > 0), 


we find that the set of orthonormal characteristic functions of this problem is the set 
{(/>„} defined by 


<t>n+ iM = 


01 M 




0 < x < L 


(n= 1,2, 3,...). 


Using the definition of Fourier series one finds that the Fourier series of a function / 
relative to this orthonormal system is the series identified in the following definition. 


DEFINITION 


Let fbea function which is defined on the interval 0 < x < L and is such that the integrals 


f(x) cos—— dx 
0 L 


(n = 0,1,2,...) 


exist. Then the series 


1 ® nnx 

+ L a* cos— , 

Z n = 1 L, 


(12.55) 


where 

* i 

nnx 

f(x) cos — dx (n = 0, 1, 2, . . .), (12.56) 

o ^ 
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is called the Fourier cosine series of f on the interval 0 < x < L. We indicate this by 
writing 

1 00 mix 

f(x) ~ X a 0 + Z a « cos ~T ’ 0< x< L. 

L n = i L 

Note that this expansion is identical with the trigonometric Fourier series (12.46) of 
the even function which is defined on — L < x < L and which coincides with / on 
0 < x < L. 

► Example 12.18 

Consider the function / defined by 

f(x) = 2x, 0 < x < 71. 

(See Figure 12.5.) Find (1) the Fourier sine series of / on 0 < x < tc, and (2) the Fourier 
cosine series of / on 0 < x < n. 

Solution 

1. In this problem L = n. The Fourier sine series of / onO < x < n is given by (12.53) 
with L = n, where the coefficients b n are given by (12.54) with L = n. We find 


2 C K 2 C n 

b n = — f(x) sin nx dx = — 2x sin nx dx 
71 Jo 71 Jo 



Figure 12.5 
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Thus the Fourier sine series of / on 0 < x < n is 

„ s (-i)" +1 . T . l . „ l . , 1 

4 > sin nx = 4 sin x — - sin 2x + - sin 3x — • • • . 

n = i n L 2 3 J 

This expansion is identical with the trigonometric Fourier series (12.48) of the odd 
function g defined by 


g(x) = 2x, — n < x < n, 

on the interval —n<x<n (see Figure 12.6). 

2. The Fourier cosine series of / on 0 < x < n is given by (12.55) with L = n, where 
the coefficients a n are given by (12.56) with L = n. We find 


2 f* 

tfo = “ 

* Jo 

2 r* 

a n=~ \ 

n Jo 


/(x) dx = ■ 


/(x) cos nx dx 


2x dx = 2n: 


_ 2 r* 

~ ^ Jo 


2x cos nx dx 


4 cos mx x sin nx 71 4 / 

= - = 1 = j (COS M7C — 1) 

n m z m o 71/1 


un 


[(-!)"-!] = 


71M 


2 > 


m odd 


0, m even 


(m = 1, 2, 3, . . .). Thus the Fourier cosine series of / on 0 < x < n is 
8 ^2, COS MX 8 ® cos(2m - l)x 

U ~n n h ~^~ = 7l ~n n h (2H — l) 2 

(n odd) 


__ 8 CO 5 

n _ 1 


cos x cos 3x cos 5x 


+ 




Figure 12.6 
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623 



This expansion is identical with the trigonometric Fourier series (12.46) of the even 
function h defined by 

h(x) = |2x|, — n < x < n, 

on the interval —n < x < n (see Figure 12.7). 


Exercises 


For each of the functions / defined in Exercises 1-6 find (a) the Fourier sine series of / 
on 0 < x < n 9 and (b) the Fourier cosine series of / on 0 < x < n. 


1- f(x) 

2 . m 

3. f(x) 

4. fix) 

5- fix) 
6. fix) 


1, 0 < x < it. 

X , 0 < X < 71. 

fO, 0 < x < 7t/2, 

(2, tc/2 < x < n. 

{ x, 0 < x < ti/2, 

(7C — X, Till < X < It. 

sin x, 0 < x < 7i. 
e x , 0 < x < 7i. 


For each of the functions / defined in Exercises 7-12 find (a) the Fourier sine series of / 
on 0 < x < L, and (b) the Fourier cosine series of / on 0 < x < L. 


7. /(x) = 2L, 0 < x < L. 

8. /(x) = 4x, 0 < x < L. 


9. 


f(x) = 



0 < x < L/2, 
L/2 < x < L. 
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10 . 

11 . 

12 . 


/(*) = 


10 ’ 

L — x 
10 ’ 


f(x) = Lx- x 2 . 


0 < x < L/2, 

L/2 < x < L. 
0 < x < L. 


/(*) = 


X 2 , 


9 L 2 
~4~ 


n 3 L 

0^x< T , 


9 Lx 3 L 
~ 4 ~’ ~4 


< x < L. 


D. Convergence of Trigonometric Fourier Series 

In Part A of this section we defined the trigonometric Fourier series of a function / on 
the interval — L < x < L. According to our definition one finds the trigonometric 
Fourier series of / by first determining certain constants (the Fourier coefficients of /) 
from the formulas 



f(x) cos ^^-dx 

Lj 


,, v . nnx 
f(x) sin — — ax 

Lj 


(« = 0 , 1 , 2 ,...), 

(» = 1,2, 3,...), 


(12.45) 


and then substituting these constants into the series 


1 

2 «° + 


00 

1 

n = 1 


( /i7rx , . nnx\ 

a„ cos— + b n sin — 1. 


(12.44) 


We have already pointed out that this development is a formal expansion since no 
mention of convergence is made. Thus the following equations arise concerning the 
trigonometric Fourier series (12.44) of the function / 


1. Is the trigonometric Fourier series of / convergent for some or all values of x in the 
interval — L < x < L? 

2. If the trigonometric Fourier series of / converges at a certain point x of this 
interval, does it converge to / (x)? 

In this section we shall state a theorem which will enable us to answer these questions 
in a manner which is satisfactory for our purposes. 

As a first step toward stating this theorem, let us consider a general trigonometric 
series of the form (12.44). We shall say that a function F is periodic of period P if there 
exists a constant P > 0 such that F(x + P) = F(x) for every x for which F is defined. In 
particular, the functions defined by sin(nnx/L) and cos(nnx/L) are periodic of period 
2L/n. Now observe that if F is periodic of period P, then F is also periodic of period nP , 
where n is a positive integer. Hence for every positive integer n, the functions defined by 
sin(n7Ex/L) and cos(nnx/L) are also periodic of period 2 L. We thus see that every term 
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in the series (12.44) is periodic of period 2 L. Therefore if the series converges for all x, 
then the function defined by the series must be periodic of period 2 L. 

In particular, suppose that the series (12.44) is the trigonometric Fourier series of a 
function / defined on — L < x < L and assume that this series converges for every x 
in this interval. Then this series will also converge for every value of x( — oo < x < oo) 
and the function so defined will be periodic of period 2 L. This suggests that a trigono- 
metric Fourier series of the form (12.44) might possibly be employed to represent a 
periodic function of period 2 L for all values of x. Such a representation is indeed pos- 
sible for a periodic function which satisfies suitable additional conditions. The con- 
vergence theorem which we shall state gives one set of such conditions. In order to 
state this theorem as simply as possible, we introduce the following definition. 


DEFINITION 

A function f is said to be piecewise smooth (or sectionally smooth) on a finite interval 
a < x < bif this interval can be divided into a finite number of subintervals such that (1) f 
has a continuous derivative f in the interior of each of these subintervals , and (2) both 
f(x) and f'(x) approach finite limits as x approaches either endpoint of each of these 
subintervals from its interior. In other words , we may say that f is piecewise smooth on 
a < x < bif both f and f are piecewise continuous on a < x < b. 


► Example 12.19 


The function / defined on the interval 0 < x < 5 by 

x 2 , 0 < x < 
2 — x, 1 < x < 
1, 3 < x < 
x - 4, 4 < x < 


fix) = 


1 , 

3, 

4, 

5, 


is piecewise smooth on this interval (see Figure 12.8). 

We now state the convergence theorem for trigonometric Fourier series on the 
interval — L < x < L. 



Figure 12.8 
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THEOREM 12.4 


Hypothesis. Let f be a function such that 

1. f is periodic of period 2 L, and 

2. f is piecewise smooth on the interval — L < x < L. 


Conclusion. The trigonometric Fourier series of f 


1 


nnx 


nnx 


where 


2 a o + Z I ' «» cos — + b„ sin — 


1 C L nux 

= — I /(x)cos — dx (n = 0, 1,2,...), 


= 


1 


nnx 


f(x) sin — dx (n = 1, 2, 3, . . .), 


converges at every point x to the value 

/(* + ) + /(*-) 


(12.44) 


(12.45) 


(12.57) 


w/zere /(x + ) is t/ie right-hand limit of f at x and /(x— ) is the left-hand limit of f at x. 

In particular , if f is also continuous at x, file ua/we (12.57) reduces to f(x) and the 
trigonometric Fourier series of f at x converges to /(x). 


We observe that the value (12.57) is the arithmetic mean (“average”) of the right- 
hand and left-hand limits of / at x. ' 


► Example 12.20 


Consider the function / defined for x in the interval — n < x < n by 

/M = 


n , — n < x < 0 , 

x, 0 < x < ft. 


and for all other x by the periodicity condition 

/(x + 2 n) = /(x) for all x. 


(12.58) 

(12.59) 


(See Figure 12.9.) 

The condition /(x + 2n) = /(x) for all x states that / is periodic of period 2n. 
Further, the function / is piecewise smooth on the interval —n<x<n. Thus the 
hypothesis of Theorem 12.4 is satisfied by this function / (with L = n). 

On the interval —n<x<n the function / is identical with the function / of 
Example 12.16. Thus the trigonometric Fourier series (12.44) of / is the series 


371 



00 


z 

, = 1 


(- I )’- 1 
2 

rr vi 


1 . 

cos nx — sin nx 
n 


which was determined in Example 12.16. 


(12.51) 
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The conclusion of Theorem 12.4 assures us that the series (12.51) converges to 


/(*+) + /(*-) 
2 


(12.57) 


at every point x, — oo < x < oo. Further, since / is continuous at every x except x = 
± 2nn (n = 0, 1 , 2, . . . ), the series ( 1 2.5 1 ) converges to / (x) at every x except x = + 2nn 
(i n = 0, 1, 2,...). For example, at x = n, the series (12.51) is 


In » 1 -(-!)" 3 n 2 » 1 

4 + ^ nn 2 4 + n „ = i (2n - l) 2 


3n 

= T + 


2 n j. 

n _1 2 + 3 ; 


1 

2 + J2 + 


and this converges to f(n) = n. Thus we may write 

3?r 2 T 1 1 1 

* = T + ^[t I + 3 I + 5 I + 

from which we obtain the interesting result 
- 2 1 1 1 


n‘ _ 1 1 I 1 

Y=l2 + 32 + 52 +-+( 2n _l) 2 


(12.60) 


At each point x = ±2^71 (n = 0, 1, 2, . . .), at which / is discontinuous, we find that 
/(x + ) = /(0 + ) = 0, f(x — ) = /( 0 — ) = 71, and hence 

/(* + ) + /(x-) n 

2 2 ' 


Thus at x = ±2nn ( n = 0, 1,2,...), the series (12.51) converges to the value tc/ 2. For 
example, at x = 0, the series is 

3tc « (-ir- 1 _ 371 2 ” 1 

4 + n =i nn 2 4 n (2 n — l) 2 

and this converges to tc/2. Thus we may write 

n 3n 2 ™ 1 

2 4 7E „=i (2/t — l) 2 ’ 

and this again leads to the result (12.60). 
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In summary, we see that the trigonometric Fourier series (12.51) of the function / 
defined by (12.58) and (12.59) converges for all x to the function g defined by 


g(*) = 


— 71 < X < 0, 

x = 0, 


(12.61) 


[xj 0 < x < n, 
g(x + 2n) = g(x) for all x. 

(See Figure 12.10 and compare with Figure 12.9.) 

Concerning the convergence of Fourier sine and Fourier cosine series on the interval 
0 < x < L, we state the following theorem. 


THEOREM 12.5 


Hypothesis. Let f be a function which is piecewise smooth on the interval 0 < x < L. 

Conclusion 

7. The Fourier sine series of f 

00 Y1TTY 

X b„ sin——, (12.53) 

n= 1 Li 

where 

^ n = z| f(x)sin~dx (n = 1,2,3,...), (12.54) 

converges to the value 


f(x + ) + fix-) 
2 


for every x such that 0 < x < L. 

In particular , if f is also continuous at x, 0 < x < L, the Fourier sine series of f at x 
converges to /(x). 

The Fourier sine series of f converges to zero at x = 0 and x = L. 
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The Fourier sine series of f converges at every point x to the value 

g(x + ) + g(x-) 

2 

where g is the odd , periodic function of period 2 L which coincides with f in the interval 
0 < x < L and is such that g(0) = g(L) = 0. 

2. The Fourier cosine series of f 

~a 0 + £ a n cos — , (12.55) 

Z n= 1 L> 

where 

2 

a -= i 

converges to the value 

f(x + ) + /(*-) 

2 


f(x) cos dx 
o L, 


(n = 0, 1,2,...). 


(12.56) 


/or every x such that 0 < x < L. 

In particular , if f is also continuous at x, 0 < x < L, the Fourier cosine series of f at x 
converges to /(x). 

The Fourier cosine series of f converges to /(0 + ) at x = 0 and to f(L-) at x = L. 
The Fourier cosine series of f converges at every point x to the value 

fe(x + ) + h(x ) 

2 ? 

where h is the even , periodic function of period 2 L which coincides with f in the interval 
0 <x< L. 


► Example 12.21 


Consider the function / defined by 

f(x) = x, 0 < x < n. 

The Fourier sine and cosine series of / on 0 < x < n are readily found to be 

(-1)" + 1 sin nx 


and 


21 
n = 1 


u 4 ^ cos(2n — l)x 
2~n n = i (2n — l) 2 ’ 


(12.62) 


(12.63) 


respectively. 

The function / is piecewise smooth on 0 < x < n; in fact, it is continuous on this 
interval. Therefore, applying Theorem 12.5, we see that the Fourier sine series (12.62) 
and the Fourier cosine series (12.63) of / both converge to /(x) at every x such that 
0 < x < n. Further, the sine series (12.62) converges to zero at x = 0 and x = n\ and 
the cosine series (12.63) converges to /(0 + ) = 0 at x = 0 and to f(n — ) = n at x = n. 
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The sine series (12.62) converges at every point x to the value 

g(x+) + g(x-) 

2 

where g is the odd, periodic function of period 2n which coincides with / in the interval 
0 < x < n and is such that g(0) = g(n) = 0. This function g is defined by 

g(x) = x, 0 < x < n, 

g(x) = 0, x = 7i, 

g( — x) = — g(x) for all x, (12.64) 

g(x + 2n) = g(x) for all x. 

Its graph is shown in Figure 12.1 1. For this function g we see that 

0(x + ) + 0(x-) 

2 = g ' X * 

for every x. Thus the sine series ( 1 2.62) converges at every point x to the value g(x) given 
by (12.64). 

The cosine series (12.63) converges at every point x to the value 

fc(x + ) + fc(*-) 

2 


where h is the even, periodic function of period 2n which coincides with / in the interval 
0 < x < 7i. This function h is defined by 

h(x) = X, 0 < X < 7T, 

h( — x) = h(x) for all x, (12.65) 

h(x 4- 2n) = h(x) for all x. 

Its graph is shown in Figure 12.12. For this function h we see that 

, Hx) 

for every x. Thus the cosine series (12.63) converges at every point x to the value h(x) 
given by (12.65). 



Figure 12.11 
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Exercises 


1. Consider the function / defined for x on the interval — n < x < n by 

0, — n < x < 0, 

/(x) = < x + 1, 0 < X < 7l/2, 

2x, n/2 < x < 7i, 

and for all other x by the periodicity condition 

/(x + 2n) = f(x) for all x. 


Discuss the convergence of the trigonometric Fourier series of / In particular, 
determine the value to which the series converges at each of the points x = 0, 
x = n/2, and x = n. 

2. Consider the function / defined on the interval — n < x < n by 

— n < x < 0, 

0 < x < 7 c/3, 

7i/3 < x < 27 t/3, 

27t/3 < X <71, 

and for all other x by the periodicity condition 

/(x + 2n) = f{x) for all x. 

Discuss the convergence of the trigonometric Fourier series of f Determine those 
values of x in — n < x < n at which this series does not converge to /(x) and 
determine the value to which the series does converge at each of these points. 


/(*) = 


(x + n) 2 , 

n 2 , 

8 , 

5, 


= CHAPTER THIRTEEN 

Nonlinear Differential Equations 


The mathematical formulation of numerous physical problems results in differential 
equations which are actually nonlinear. In many cases it is possible to replace such a 
nonlinear equation by a related linear equation which approximates the actual 
nonlinear equation closely enough to give useful results. However, such a “lineariza- 
tion” is not always feasible; and when it is not, the original nonlinear equation itself 
must be considered. While the general theory and methods of linear equations are 
highly developed, very little of a general character is known about nonlinear equations. 
The study of nonlinear equations is generally confined to a variety of rather special 
cases, and one must resort to various methods of approximation. In this chapter we 
shall give a brief introduction to certain of these methods. 


13.1 PHASE PLANE, PATHS, AND CRITICAL POINTS 


A. Basic Concepts and Definitions 


In this chapter we shall be concerned with second-order nonlinear differential 
equations of the form 


d 2 x 
It T 



(13.1) 


As a specific example of such an equation we list the important van der Pol equation 


d 2 x 

dt 2 


+ n(x 2 — 1)— + x = 0, 

at 


(13.2) 


where n is a positive constant. We shall consider this equation at a later stage of our 
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study. For the time being, we merely note that ( 1 3.2) may be put in the form (13.1), where 


F 





Let us suppose that the differential equation (13.1) describes a certain dynamical 
system having one degree of freedom. The state of this system at time t is determined by 
the values of x (position) and dx/dt (velocity). The plane of the variables x and dx/dt is 
called a phase plane. 

If we let y = dx/dt , we can replace the second-order equation (13.1) by the equivalent 
system 


dx 

It 


y. 




(13.3) 


We can determine information about Equation (13.1) from a study of the system (13.3). 
In particular we shall be interested in the configurations formed by the curves which the 
solutions of (13.3) define. We shall regard t as a parameter so that these curves will 
appear in the xy plane. Since y = dx/dt , this xy plane is simply the x, dx/dt phase plane 
mentioned in the preceding paragraph. 

More generally, we shall consider systems of the form 


dx 

T, ~ P(x ’ A 

dy 

{ = Q^yi 


(13.4) 


where P and Q have continuous first partial derivatives for all (x, y). Such a system, in 
which the independent variable t appears only in the differentials dt of the left members 
and not explicitly in the functions P and Q on the right, is called an autonomous system. 
We shall now proceed to study the configurations formed in the xy phase plane by the 
curves which are defined by the solutions of (13.4). 

From Chapter 10, Section 10.4A, Theorem 10.5, it follows that, given any number t 0 
and any pair (x 0 , y 0 ) of real numbers, there exists a unique solution 

X = f(t), 

M (13.5) 

y = g(t)> 

of the system (13.4) such that 

/(to) = x 0 , 


g(to) = yo- 

If / and g are not both constant functions, then (13.5) defines a curve in the xy plane 
which we shall call a path (or orbit or trajectory) of the system (13.4). 

If the ordered pair of functions defined by (13.5) is a solution of (13.4) and t t is any 
real number, then it is easy to see that the ordered pair of functions defined by 


x = f(t — t t ). 


y = g(t- 1 1 ), 


(13.6) 
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is also a solution of (13.4). Assuming that / and g in (13.5) are not both constant 
functions and that t x # 0, the solutions defined by (13.5) and (13.6) are two different 
solutions of (13.4). However, these two different solutions are simply different 
parametrizations of the same path. We thus observe that the terms solution and path 
are not synonymous. On the one hand, a solution of (13.4) is an ordered pair of functions 
(f g) such that x = f(t), y = g(t) simultaneously satisfy the two equations of the system 
(13.4) identically; on the other hand, a path of (13.4) is a curve in the xy phase plane, 
which may be defined parametrically by more than one solution of (13.4). 

Through any point of the xy phase plane there passes at most one path of (13.4). Let 
C be a path of (13.4) and consider the totality of different solutions of (13.4) which 
define this path C parametrically. For each of these defining solutions, C is traced out 
in the same direction as the parameter t increases. Thus with each path C there is 
associated a definite direction, the direction of increase of the parameter t in the 
various possible parametric representations of C by the corresponding solutions of the 
system. In our figures we shall use arrows to indicate this direction associated with a 
path. 

Eliminating t between the two equations of the system (13.4), we obtain the equation 


dy Qjx, y) 
dx P(x, y) ’ 


(13.7) 


This equation gives the slope of the tangent to the path of (13.4) passing through the 
point (x, y), provided the functions P and Q are not both zero at this point. The one- 
parameter family of solutions of (13.7) thus provides the one-parameter family of paths 
of (13.4). However, the description (13.7) does not indicate the directions associated 
with these paths. 

At a point (x 0 , y 0 ) at which both P and Q are zero, the slope of the tangent to the 
path, as defined by (13.7), is indeterminate. Such points are singled out in the following 
definition. 


DEFINITION 

Given the autonomous system 


a point (x 0 , y 0 ) at which both 

P(x o ,y o ) = 0 and Q(x 0 , y 0 ) = 0 
is called a critical point* of (13.4). 


dx 


dy 

i r my), 


(13.4) 


The terms equilibrium point and singular point are also used. 
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► Example 13.1 

Consider the linear autonomous system 

y> 

(13.8) 

— x. 

Using the methods developed in Chapter 7, we find that the general solution of this 
system may be written 

x = c i sin t — c 2 cos t, 
y = c i cos t + c 2 sin t , 

where c t and c 2 are arbitrary constants. The solution satisfying the conditions x(0) = 0, 
y(0) = 1 is readily found to be 


dx 

li 

dy 

dt 


x = sin t , 
y = cos t. 


(13.9) 


This solution defines a path C Y in the xy plane. The solution satisfying the conditions 
x(0) = - 1, y( 0) = 0 is 


x = sin(r - n/2), 
y = cos (t - n/2). 


(13.10) 


The solution ( 1 3. 10) is different from the solution ( 1 3.9), but (13.10) also defines the same 
path C v That is, the ordered pairs of functions defined by (13.9) and (13.10) are two 
different solutions of (13.8) which are different parametrizations of the same path C i . 
Eliminating t from either (13.9) or (13.10) we obtain the equation x 2 + y 2 = 1 of the 
path C i in the xy phase plane. Thus the path C l is the circle with center at (0, 0) and 
radius 1. From either (13.9) or (13.10) we see that the direction associated with C l is the 
clockwise direction. 

Eliminating t between the equations of the system (13.8) we obtain the differential 
equation 


dy x 

dx y 9 


(13.11) 


which gives the slope of the tangent to the path of (13.8) passing through the point 
(x, y\ provided (x, y) # (0, 0). 

The one-parameter family of solutions 

x 2 + y 2 = c 2 


of Equation (13.11) gives the one-parameter family of paths in the xy phase plane. 
Several of these are shown in Figure 13.1. The path C t referred to above is, of course, 
that for which c = 1. 
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Looking back at the system (13.8), we see that P(x,y) = y and Q(x, y) = -x. 
Therefore the only critical point of the system is the origin (0, 0). Given any real number 
t 0 , the solution x = f(t ), y = g(t) such that f(t 0 ) = g{t 0 ) = 0 is simply 

x = 0, 

~ for all t. 

y = o, 

We can also interpret the autonomous system (13.4) as defining a velocity vector field 
V , where 

v(x> y ) = lP(x> y\ 6(*> jO]* 

The x component of this velocity vector at a point (x, y) is given by P(x, y), and the 
y component there is given by g(x, y). This vector is the velocity vector of a 
representative point R describing a path of (13.4) defined parametrically by a solution 
x = f{t\ y = g{t). At a critical point both components of this velocity vector are zero, 
and hence at a critical point the point R is at rest. 

In particular, let us consider the special case (13.3), which arises from a dynamical 
system described by the differential equation (13.1). At a critical point of (13.3) both 
dx/dt and dy/dt are zero. Since 

dy d 2 x 
dt dt 2 ’ 

we thus see that at such a point the velocity and acceleration of the dynamical system 
described by (13.1) are both zero. Thus the critical points of (13.3) are equilibrium 
points of the dynamical system described by (13.1). 

We now introduce certain basic concepts dealing with critical points and paths. 
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DEFINITION 

A critical point (x 0 , y 0 ) of the system (13.4) is called isolated if there exists a circle 

(x - x 0 ) 2 + (y- y 0 ) 2 = r 2 

about the point (x 0 , y 0 ) such that (x 0 , y 0 ) * 5 the only critical point of (13.4) within this 
circle. 

In what follows we shall assume that every critical point is isolated. 

Note. For convenience, we shall take the critical point (x 0 , y 0 ) to be the origin (0, 0). 
There is no loss in generality in doing this, for if (x 0 , y 0 ) / (0, 0), then the translation of 
coordinates £ = x — x 0 ,rj = y — y 0 transforms (x 0 , y 0 ) into the origin in the £r\ plane. 


DEFINITION 

Let Cbea path of the system (13.4), and letx = f (t), y = g(t) be a solution of (13.4) which 
represents C parametrically . Let (0, 0) be a critical point of (13.4). We shall say that the 
path C approaches the critical point (0, 0) as t -+ + oo if 

lim f(t) = 0, lim g(t) = 0. (13.12) 

t — ► + OO t~* + OO 

Thus when we say that a path C defined parametrically by x = f(t), y = g(t) 
approaches the critical point (0, 0) as t -+ + oo, we understand the following: a point R 
tracing out C according to the equations x = f(t), y = g(t) will approach the point 
(0, 0) as t-+ +oo. This approach of a path C to the critical point (0, 0) is independent of 
the solution actually used to represent C. That is, if C approaches (0, 0) as t + oo, 
then (13.12) is true for all solutions x = f(t), y = g(t) representing C. 

In like manner, a path approaches the critical point (0,0) as t -+ — oo if 

lim fft) = 0, lim gft) = 0, 

t-> — OO t~* - 00 

where x = fft), y = gft) is a solution defining the path C v 


DEFINITION 


Let Cbea path of the system (13.4) which approaches the critical point (0, 0) of (13.4) as 
£-►+ oo, and let x = f(t), y = g(t) be a solution of (13.4) which represents C 
parametrically. We say that C enters the critical point (0,0) as t + oo if 


lim 

t-* + oo 


qtt) 

fit) 


(13.13) 


exists or if the quotient in ( 1 3 . 1 3). becomes either positively or negatively infinite as t — > 

+ OO. 


We observe that the quotient g(t)/f(t) in (13.13) represents the slope of the line 
joining critical point (0,0) and a point R with coordinates [/(£), g(t )] on C. Thus when 
we say that a path C enters the critical point (0,0) as t -► + oo we mean that the line 
joining (0, 0) and a point R tracing out C approaches a definite “limiting” direction as 
t — ► + 00. 
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B. Types of Critical Points 

We shall now discuss certain types of critical points which we shall encounter. We shall 
first give a geometric description of each type of critical point, referring to an 
appropriate figure as we do so; we shall then follow each such description with a more 
precise definition. 

(1) The critical point (0,0) of Figure 13.2 is called a center. Such a point is 
surrounded by an infinite family of closed paths, members of which are arbitrarily close 
to (0, 0), but it is not approached by any path either as t -+ + oo or as t — oo. 

More precisely, we state: 


DEFINITION 

The isolated critical point (0, 0) of (13.4) is called a center if there exists a neighborhood 
of (0, 0) which contains a countably infinite number of closed paths P n (n = 1, 2, 3,...), 
each of which contains (0, 0) in its interior, and which are such that the diameters of the 
paths approach 0 as n -+ oo [ but (0, 0) is not approached by any path either ast -+ + oo or 
as t-+ — oo]. 

Note . The preceding definition contains several terms which may be unfamiliar to 
some readers. Moreover, an explanatory statement concerning the infinite number of 
closed paths should be made. We proceed to discuss these matters. 

1. We define a neighborhood of (0, 0) to be the set of all points (x, y) lying within some 
fixed (positive) distance d of (0, 0). 



Figure 13.2 
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2. An infinite set is called countable if it can be put into one-to-one correspondence 
with the set of all positive integers. An example of a countable set is the set of all 
rational numbers. 

3. An infinite set is uncountable if it is not countable. An example of an uncountable 
set is the set of all real numbers. 

4. It can be shown that an uncountably infinite set always contains a countably 
infinite subset. Thus the definition of center does not exclude the possibility of an 
uncountably infinite set of closed paths each of which contains (0, 0) in its interior. 
Indeed, in the case of a linear system and all of the nonlinear systems which we 
shall encounter in this text, there always exists such an uncountably infinite set of 
closed paths containing the center in their interior. 

5. We call the diameter of a closed curve C the maximum of the distance d(A , B) 
between points A and B on C for all possible pairs of points A and B on C. 

(2) The critical point (0, 0) of Figure 1 3.3 is called a saddle point. Such a point may be 

characterized as follows: 

1. It is approached and entered by two half-line paths {AO and BO) as t -* + oo, these 
two paths forming the geometric curve AB. 

2. It is approached and entered by two half-line paths (CO and DO) as t -+ - oo, these 
two paths forming the geometric curve CD. 

3. Between the four half-line paths described in (1) and (2) there are four domains R l9 
R 2 , R 3 , R 4 , each containing an infinite family of semi-hyperbolic-like paths which 
do not approach 0 as t -+ + oo or as t -* — oo, but which become asymptotic to 
one or another of the four half-line paths as t -* -f oo and as t -* — oo. 



Figure 13.3 
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More precisely, we state: 

DEFINITION 

The isolated critical point (0, 0) of {13.4) is called a saddle point if there exists a 
neighborhood of (0, 0) in which the following two conditions hold: 

1. There exist two paths which approach and enter (0,0) from a pair of opposite 
directions ast^> + oo, and there exist two paths which approach and enter (0,0) from a 
different pair of opposite directions as t -> — oo. 

2. In each of the four domains between any two of the four directions in (7) there are 
infinitely many paths which are arbitrarily close to (0, 0) but which do not approach (0, 0) 
either as t + oo or as t — oo. 


(3) The critical point (0, 0) of Figure 1 3.4 is called a spiral point (or focal point). Such 
a point is approached in a spiral-like manner by an infinite family of paths as t -> + oo 
(or as t -* — oo ). Observe that while the paths approach O, they do not enter it. That is, a 
point R tracing such a path C approaches O as t -+ + oo (or as t — oo), but the line OR 
does not tend to a definite direction, since the path constantly winds about O. 

More precisely, we state: 

DEFINITION 

The isolated critical point (0, 0) of {13.4) is called a spiral point {or focal point) if there 
exists a neighborhood of (0, 0) such that every path P in this neighborhood has the 
following properties: 

1. P is defined for all t > t 0 {or for all t < t 0 ) for some number t 0 ; 



Figure 13.4 
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2. P approaches (0, 0) as t -+ -f oo (or as t -+ — oo); and 

3. P approaches (0, 0) in a spiral-like manner , winding around (0, 0) an infinite number 
of times as t -* + oo (or as t ^ — oo). 

(4) The critical point (0, 0) of Figure 13.5 is called a node . Such a point is not only 
approached but also entered by an infinite family of paths as t-+ + oo (or as t -+ — oo). 
That is, a point R tracing such a path not only approaches O but does so in such a way 
that the line OR tends to a definite direction as t + oo (or as t-+ — oo). For the node 
shown in Figure 13.5 there are four rectilinear paths, AO , BO, CO, and DO. All other 
paths are like “semiparabolas.” As each of these semiparabolic-like paths approaches 
O, its slope approaches that of the line AB. 

More precisely, we state: 


DEFINITION 

The isolated critical point (0, 0) of (13.4) is called a node if there exists a neighborhood of 
(0, 0) such that every path P in this neighborhood has the following properties: 

1. P is defined for all t > t 0 [or for all t < t 0 ] for some number t 0 \ 

2. P approaches (0, 0) as t -+ + oo [or as t ^ — oo]; and 

3. P enters (0, 0) as t -+ + oo [or as t -+ — oo]. 


C. Stability 


We assume that (0, 0) is an isolated critical point of the system 

Sr-*** 

sr e( *- A 


(13.4) 



B 


Figure 1 3.5 
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and proceed to introduce the concepts of stability and asymptotic stability for this 
critical point. 


DEFINITION 

Assume that (0, 0) is an isolated critical point of the system (13.4). Let C be a path of 
(13.4); let x = f(t), y = g(t) be a solution of (13.4) defining C parametrically. Let 

m = y/UW 2 + [ 0 (f )] 2 (i3.i4) 

denote the distance between the critical point (0, 0) and the point R: \_f(t\ g(t)] on C. The 
critical point (0, 0) is called stable if for every number e > 0, there exists a number S > 0 
such that the following is true: Every path C for which 

D(t 0 ) < S for some value t 0 (13.15) 

is defined for all t > t 0 and is such that 

D(t) <e for t 0 < t < 00 . (13.16) 

Let us try to analyze this definition, making use of Figure 13.6 as we do so. The 
critical point (0, 0) is said to be stable if, corresponding to every positive number e, we 
can find another positive number S which does “something” for us. Now what is this 
“something”? To answer this we must understand what the inequalities in (13.15) and 
(13.16) mean. According to ( 1 3. 1 4), the inequality D(t 0 ) < d for some value r 0 in (13.15) 
means that the distance between the critical point (0, 0) and the point R on the patch C 
must be less than d at t = t 0 . This means that at t = t 0 , JR lies within the circle K t of 
radius S about (0, 0) (see Figure 13.6). Likewise the inequality D(t) < e for t 0 < t < oo 
in (13.16) means that the distance between (0, 0) and R is less than e for all t > t 0 , and 
hence that for t > t 0 , R lies within the circle K 2 of radius e about (0. 0). Now we can 
understand the “something” which the number <5 does for us. If (0, 0) is stable, then 
every path C which is inside the circle of radius d at t = t 0 will remain inside the 



Figure 1 3.6 
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circle K 2 of radius t > t 0 . Roughly speaking, if every path C stays as close to (0, 0) as we 
want it to (that is, within distance c) after it once gets close enough (that is, within 
distance <5), then (0, 0) is stable. 

DEFINITION 

Assume that (0, 0) is an isolated critical point of the system {13.4). Let C be a path of 
( 13.4); and letx = f ( t ), y = g(t) be a solution of {13.4) representing C parametrically. Let 

D(t) = V[/(t )] 2 + [0(t)] 2 (13-14) 

denote the distance between the critical point (0, 0) and the point R: [/(t ), g{t)~\ on C. 

The critical point (0, 0) is called asymptotically stable if {1) it is stable and (2) there 
exists a number d 0 > 0 such that if 

D{t 0 ) < d 0 (13.17) 

for some value t 0 , then 

lim f{t) = 0, lim g{t) = 0. (13.18) 

t~* + 00 t~* + 00 

To analyze this definition, note that condition (1) requires that (0, 0) must be stable. 
That is, every path C will stay as close to (0, 0) as we desire after it once gets sufficiently 
close. But asymptotic stability is a stronger condition than mere stability. For, in 
addition to stability, the condition (2) requires that every path that gets sufficiently 
close to (0, 0) [see (13.17)] ultimately approaches (0, 0) as t -* + oo [see (13.18)]. Note 
that the path C of Figure 13.6 has this property. 

DEFINITION 

A critical point is called unstable if it is not stable. 

As illustrations of stable critical points, we point out that the center in Figure 13.2, 
the spiral point in Figure 13.4, and the node in Figure 13.5 are all stable. Of these three, 
the spiral point and the node are asymptotically stable. If the directions of the paths in 
Figures 13.4 and 13.5 has been reversed, then the spiral point and the node of these 
respective figures would have both been unstable. The saddle point of Figure 13.3 is 
unstable. 


Exercises 


For each of the autonomous systems in Exercises 1 and 2, (a) find the real critical points 
of the system, (b) obtain the differential equation which gives the slope of the tangent to 
the paths of the system, and (c) solve this differential equation to obtain the one- 
parameter family of paths. 


4 dx , 

~ dx 


1 *-* 

dy 2 

jr x ~ y - 

£ = sin x. 
d t 
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3. Consider the linear autonomous system 


dx 

It 


x, 


dy 

dt 


= x + y. 


(a) Find the solution of this system which satisfies the conditions x(0) = 1, 

y( o) = 3. 

(b) Find the solution of this system which satisfies the conditions x(4) = e , 
y( 4) = 4e. 

(c) Show that the two different solutions found in (a) and (b) both represent the 
same path. 

(d) Find the differential equation which gives the slope of the tangent to the 
paths, and solve it to obtain the one-parameter family of paths. 

4. Consider the linear autonomous systems 


(a) 



dy 

dt 


= -2 y. 


(b) 



dy 

dt 


-x. 


Show that the critical point (0, 0) of system (a) is asymptotically stable, but that the 
critical point (0, 0) of system (b) is stable but not asymptotically stable. 


13.2 CRITICAL POINTS AND PATHS OF LINEAR SYSTEMS 
A. Basic Theorems 


Our main purpose in this chapter is the study of nonlinear differential equations and 
the corresponding nonlinear autonomous systems of the form (13.4). We shall be 
interested in classifying the critical points of such nonlinear systems. In Section 13.3 we 
shall see that under appropriate circumstances we may replace a given nonlinear 
system by a related linear system and then employ this linear system to determine the 
nature of the critical point of the given system. Thus in this section we shall first 
investigate the critical points of a linear autonomous system. 

We consider the linear system 


dx 

— = ax + 
dt 


by, 


dy 

— = cx + dy, 
dt 


(13.19) 


where a, b, c, and d (in the right member of the second equation) are real constants. The 
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origin (0, 0) is clearly a critical point of (13.19). We assume that 

bl # 0 , 


(13.20) 


and hence (0, 0) is the only critical point of (13.19). 

Recall that in Chapter 7 we sought and found solutions of (13.19) of the form 


x = Ae x \ 
y = Be Xt . 


(13.21) 


We saw there that if (13.21) is to be a solution of (13.19), then X must satisfy the 
quadratic equation 

X 1 - (a + d)X + {ad - be) = 0, (13.22) 

called the characteristic equation of (13.19). Note that by condition (13.20) zero cannot 
be a root of the equation (13.22) in the problem under discussion. 

Let X t and X 2 be the roots of the characteristic equation (13.22). We shall prove that 
the nature of the critical point (0, 0) of the system (13.19) depends upon the nature of 
the roots X i and X 2 . We would expect three possibilities, according as X t and X 2 are real 
and distinct, real and equal, or conjugate complex. But actually the situation here is not 
quite so simple and we must consider the following five cases: 

1. X { and X 2 are real, unequal, and of the same sign. 

2. ‘ X i and X 2 are real, unequal, and of opposite sign. 

3. X Y and X 2 are real and equal. 

4. X i and X 2 are conjugate complex but not pure imaginary. 

5. X { and X 2 are pure imaginary. 

Case 1 

THEOREM 13.1 

Hypothesis. The roots X t and X 2 of the characteristic equation (13.22) are real , 
unequal , and of the same sign. 

Conclusion. The critical point (0, 0) of the linear system (13.19) is a node. 

Proof. We first assume that X t and X 2 are both negative and take X x < X 2 < 0. By 
Theorem 7.7 of Chapter 7 the general solution of the system (13.19) may then be 
written 


(13.23) 


x = c l A l e Xlt -f c 2 A 2 e Xzt , 

y = + c 2 B 2 e Xzt , 

where A l9 B l9 A 2 , and B 2 are definite constants and A t B 2 # A 2 B l9 and where c x and c 2 
are arbitrary constants. Choosing c 2 = 0 we obtain the solutions 

x = c^A^e x ^ 
y = 


(13.24) 
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choosing c i = 0 we obtain the solutions 


x = c 2 A 2 e Xl \ 
y = c 2 B 2 e Xlt . 


(13.25) 


For any c i > 0, the solutions (13.24) represent a path consisting of “half” of the line 
B t x = A t y of slope BJA V For any c t < 0, they represent a path consisting of the 
“other half” of this line. Since X t < 0, both of these half-line paths approach (0, 0) as 
t -► + oo. Also, since y/x = B t /A l9 these two paths enter (0, 0) with slope B i /A 1 . 

In like manner, for any c 2 > 0 the solutions (13.25) represent a path consisting of 
“half” of the line B 2 x = A 2 y; while for any c 2 < 0, the path so represented consists of 
the “other half” of this line. These two half-line paths also approach (0, 0) as t -► + oo 
and enter it with slope B 2 /A 2 . 

Thus the solutions (13.24) and (13.25) provide us with four half-line paths which all 
approach and enter (0, 0) as t -> + oo. 

If c t # 0 and c 2 # 0, the general solution (13.23) represents nonrectilinear paths. 
Again, since X t < X 2 < 0, all of these paths approach (0, 0) as t -> + oo. Further, since 

y c l B l e Xit + c 2 B 2 e Xlt _ (c l B l /c 2 )e (Xl ~ X2)t + B 2 
x c^A^e^ 1 + c 2 A 2 e Xlt (< c l A l /c 2 )e iXl ~ Xl)t + A 2 


we have 

um* = 5i 

r-ooX A 2 

and so all of these paths enter (0, 0) with limiting slope B 2 /A 2 . 

Thus all the paths (both rectilinear and nonrectilinear) enter (0, 0) as t -> + oo, and all 
except the two rectilinear ones defined by (13.24) enter with slope B 2 /A 2 . According 
to the definition of Section 13. IB, the critical point (0,0) is a node. Clearly, it is 
asymptotically stable. A qualitative diagram of the paths appears in Figure 13.7. 

If now X t and X 2 are both positive and we take X l > X 2 > 0, we see that the general 
solution of ( 1 3. 1 9) is still of the form ( 1 3.23) and particular solutions of the forms ( 1 3.24) 
and (13.25) still exist. The situation is the same as before, except all the paths approach 
and enter (0,0) as t-> — oo. The qualitative diagram of Figure 13.7 is unchanged, 
except that all the arrows now point in the opposite directions. The critical point (0, 0) is 
still a node , but in this case it is clear that it is unstable. 


Case 2 


THEOREM 13.2 

Hypothesis. The roots X t and X 2 of the characteristic equation (13.22) are real , 
unequal , and of opposite sign. 

Conclusion. The critical point (0, 0) of the linear system .(13.19) is a saddle point. 

Proof. We assume that X t < 0 < X 2 . The general solution of the system (13.19) 
may again be written in the form (13.23) and particular solutions of the forms (13.24) 
and (13.25) are again present. 

For any c l > 0, the solutions (13.24) again represent a path consisting of “half” the 
line B x x = A x y\ while for any c l < 0 , they again represent a path consisting of the 
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“other half” of this line. Also, since X t < 0, both of these half-line paths still approach 
and enter (0, 0) as t -► + oo. 

Also, for any c 2 > 0, the solutions (13.25) represent a path consisting of “half” the 
line B 2 x = A 2 y; and for any c 2 < 0, the path which they represent consists of the “other 
half” of this line. But in this case, since X 2 > 0, both of these half-line paths now 
approach and enter (0, 0) as t -> - oo. 

Once again, if / 0 and c 2 # 0, the general solution (13.23) represents non- 
rectilinear paths. But here since X t < 0 < X 2 , none of these paths can approach (0, 0) as 
t -► + oo or as t -> — oo. Further, none of them pass through (0, 0) for any t 0 such that 
— co<t 0 < + oo. As t -> + oo, we see from (13.23) that each of these nonrectilinear 
paths becomes asymptotic to one of the half-line paths defined by (13.25). As t — oo, 
each of them becomes asymptotic to one of the paths defined by (13.24). 

Thus there are two half-line paths which approach and enter (0, 0) as t -► + oo and 
two other half-line paths which approach and enter (0, 0) as t -► — oo. All other paths 
are nonrectilinear paths which do not approach (0, 0) as t -► + oo or as t -► — oo, but 
which become asymptotic to one or another of the four half-line paths as t -► + oo and 
as t -> - oo. According to the description and definition of Section 13. IB, the critical 
point (0, 0) is a saddle point. Clearly, it is unstable. A qualitative diagram of the paths 
appears in Figure 13.8. 

Case 3 

THEOREM 13.3 

Hypothesis. The roots X t and X 2 of the characteristic equation (13.22) are real and 
equal. 
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Conclusion. The critical point (0, 0) of the linear system (13.19) is a node. 

Proof. Let us first assume that X t = X 2 = X < 0. We consider two subcases: 
3(a) a = d^0,h = c = 0. 

3(b) All other possibilities leading to a double root. 

We consider first the special case 3(a). The characteristic equation (13.22) becomes 

X 2 — 2aX + a 2 = 0 


and hence X = a = d. The system (13.19) itself is thus simply 


dx 

~dt 


= Xx , 



The general solution of this system is clearly 


x — c±e^ 1 , 
y = c 2 e M , 


(13.26) 


where and c 2 are arbitrary constants. The paths defined by (13.26) for the various 
values of c { and c 2 are half-lines of all possible slopes. Since X < 0, we see that each of 
these half-lines approaches and enters (0, 0) as t -► + oo. That is, all the paths of the 
system enter (0, 0) as t -► + oo. According to the definition of Section 13. IB, the critical 
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point (0, 0) is a node. Clearly, it is asymptotically stable. A qualitative diagram of the 
paths appears in Figure 13.9. 

If A > 0, the situation is the same except that the paths enter (0, 0) as t -► — oo, the 
node (0, 0) is unstable , and the arrows in Figure 13.9 are all reversed. 

We mention that this type of node is sometimes called a star-shaped node. 

Let us now consider Case 3(b). Here the characteristic equation has the double root 
A < 0, but we exclude the special circumstances of Case 3(a). By Theorem 7.9 of 
Chapter 7 the general solution of the system (13.19) may in this case be written 

x = c x Ae Xt + c 2 (A l t + A 2 )e Xt , 

y = c x Be Xt + c 2 (B x t + B 2 )e x \ (13.27) 

where the A 9 s and B 9 s are definite constants, c t and c 2 are arbitrary constants, and 
B x /A x = B/A. Choosing c 2 = 0 in (13.27) we obtain solutions 


x = c x Ae Xt , 


y = c x Be Xt . 


(13.28) 


For any c x > 0, the solutions (13.28) represent a path consisting of “half” of the line 
Bx = Ay of slope B/A; for any c x < 0, they represent a path which consists of the 
“other half” of this line. Since A < 0, both of these half-line paths approach (0, 0) as 
t -> + oo. Further, since y/x = B/A , both paths enter (0, 0) with slope B/A. 

If c 2 # 0, the general solution (13.27) represents nonrectilinear paths. Since A < 0, we 
see from (13.27) that 


lim x = 0, lim y = 0. 

t-* + oo t-* + 00 

Thus the nonrectilinear paths defined by (13.27) all approach (0, 0) as t -> + oo. Also, 



Figure 13.9 
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since 

y _ c t Be Xt + c 2 (B Y t + B 2 )e Xt _ (c t B/c 2 ) + B 2 + B t t 
x c^Ae^ + c 2 (A l t + A 2 )e Xt (c i A/c 2 ) + A 2 + A t t’ 

we see that 


lim * = 

t~* + oo X 


Bi 

A i 


B 

A' 


Thus all the nonrectilinear paths enter (0, 0) with limiting slope B/A. 

Thus all the paths (both rectilinear and nonrectilinear) enter (0, 0) as t -► + oo with 
slope B/A. According to the definition of Section 1 3. 1 B, the critical point (0, 0) is a node . 
Clearly, it is asymptotically stable. A qualitative diagram of the paths appears in 
Figure 13.10. 

If X > 0, the situation is again the same except that the paths enter (0, 0) as t -+ — oo, 
the node (0, 0) is unstable , and the arrows in Figure 13.10 are reversed. 


Case 4 


THEOREM 13.4 

Hypothesis. The roots X i and X 2 of the characteristic equation (13.22) are conjugate 
complex with real part not zero ( that is, not pure imaginary). 

Conclusion. The critical point (0, 0) of the linear system (13.19) is a spiral point. 

Proof. Since X l and X 2 are conjugate complex with real part not zero, we may write 
these roots as a ± if, where a and f are both real and unequal to zero. Then by 



Figure 13.10 
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Theorem 7.8 of Chapter 7, the general solution of the system (13.19) may be written 
x = e at [c l (A l cos fit - A 2 sin pt) + c 2 (A 2 cos fit + A t sin /St)], 

(13.29) 

y = e at [c l (B l cos /St - B 2 sin pt) + c 2 (B 2 cos pt + B { sin fit)], 

where A t , A 2 , B u and B 2 are definite real constants and c t and c 2 are arbitrary 
constants. 

Let us first assume that a < 0. Then from (13.29) we see that 
lim x = 0 and lim y = 0 


t-* + oo 


t-> + c 


and hence all paths defined by (13.29) approach (0, 0) as t -► -f oo. We may rewrite 
(13.29) in the form 


x = e at (c 3 cos Pt + c 4 sin pt), 
e at (c 5 cos pt + c 6 sin Pt), 




(13.30) 


where c 3 = c l A l + c 2 A 2 , c 4 = c 1 A l - c l A 2 , c 5 = + c 2 B 2 , and c 6 = c 2 B x — 

c x B 2 . Assuming c x and c 2 are real, the solutions (13.30) represent all paths in the real xy 
phase plane. We may now put these solutions in the form 

x = K Y e™ cos (Pt + 0J, 

y = K 2 e at cos (Pt + 0 2 ), (13.31) 

where K i = ^Jc\ + c\, K 2 = + c\ and cf) l and 0 2 are defined by the equations 


i 

COS 0! = — , 




* C 5 

COS 0 2 = — , 

^2 


sin 0i = — 


<U_ 

*i’ 


sin 02 = — 




Let us now consider 


y _ K 2 e at cos (pt + 0 2 ) 
x K^e* 1 cos (Pt + 0 !)* 

Letting K = K 2 /K t and 0 3 = 0i — 0 2 , this becomes 

y = K cos(j 8 r + 0 ! - 0 3 ) 
x cos (Pt + 0 J 

cos(/fa + 0 t ) cos 03 + sin(/?t + 0 !) sin 0 3 


(13.32) 


= Kp 


COS(/?t + 

= K[cos <t> 3 + sin <p 3 tan (pt + c^)], 


(13.33) 


provided cos(j 8t + </;, ) # 0. As a result of the periodicity of the trigonometric functions 

y 

involved in (13.32) and (13.33) we conclude from these expressions that lim — does 

t~* + oo X 

not exist and so the paths do not enter (0, 0). Instead, it follows from (13.32) and (13.33) 
that the paths approach (0, 0) in a spiral-like manner, winding around (0, 0) an infinite 
number of times as t -► + oo. According to the definition of Section 13. IB, the critical 
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point (0, 0) is a spiral point. Clearly, it is asymptotically stable. A qualitative diagram of 
the paths appears in Figure 13.1 1. 

If a > 0, the situation is the same except that the paths approach (0, 0) as t -> — oo, 
the spiral point (0, 0) is unstable , and the arrows in Figure 13.1 1 are reversed. 


Case 5 

THEOREM 13.5 

Hypothesis. The roots and X 2 of the characteristic equation (13.22) are pure 
imaginary. 

Conclusion. The critical point (0, 0) of the linear system (13.19) is a center. 

Proof.- Since X x and X 2 are pure imaginary we may write them as a ± /?/, where 
a = 0 but is real and unequal to zero. Then the general solution of the system ( 1 3. 1 9) is 
of the form (13.29), where a = 0. In the notation of (13.31) all real solutions may be 
written in the form 


x = K x cos (ft -f </>!), 

y = K 2 cos(0f + 4> 2 ), (13.34) 

where K l ,K 2 ,(j) l , and <p 2 are defined as before. The solutions (1 3.34) define the paths in 
the real xy phase plane. Since the trigonometric functions in (1 3.34) oscillate indefinitely 
between + 1 as t -► + oo and as t -> - oo, the paths do not approach (0, 0) as t -► + oo 
or as t -► — oo. Rather it is clear from (1 3.34) that x and y are periodic functions of t and 


13.2 CRITICAL POINTS AND PATHS OF LINEAR SYSTEMS 


653 



hence the paths are closed curves surrounding (0, 0), members of which are arbitrarily 
close to (0, 0). Indeed they are an infinite family of ellipses with center at (0, 0). 
According to the definition of Section 13. IB, the critical point (0, 0) is a center. Clearly, 
it is stable. However, since the paths do not approach (0, 0), the critical point is not 
asymptotically stable. A qualitative diagram of the paths appears in Figure 13.12. 

We summarize our results in Table 13.1. The stability results of column 3 of this table 
lead at once to Theorem 13.6. 


THEOREM 13.6 


Consider the linear system 



+ by , 



+ dy. 


(13.19) 


where ad — be # 0, so that (0, 0) is the only critical point of the system. 

1. If both roots of the characteristic equation (13.22) are real and negative or 
conjugate complex with negative real parts , then the critical point (0,0) of (13.19) is 
asymptotically stable. 

2. If the roots of (13.22) are pure imaginary , then the critical point (0, 0) of (13.19) is 
stable , but not asymptotically stable. 

3. If either of the roots of (13.22) is real and positive or if the roots are conjugate 
complex with positive real parts , then the critical point (0, 0) of (13.19) is unstable. 
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TABLE 13.1 


Nature of roots X x and X 2 °f 
characteristic equation 

X 2 — (a + d)X + (ad — be) = 0 

Nature of critical point 
(0, 0) of linear system 

dx 

Ti = ax + by ’ 

dy 

- — = c :x + dy 
dt 

Stability of critical 
point (0, 0) 

real, unequal, and of same sign 

node 

asymptotically stable 
if roots are negative; 
unstable if roots are 
positive 

real, unequal, and of opposite 
sign 

saddle point 

unstable 

real and equal 

node 

asymptotically stable 
if roots are negative; 
unstable if roots are 
positive 

conjugate complex but not pure 
imaginary 

spiral point 

asymptotically stable 
if real part of roots is 
negative; unstable if 
real part of roots is 
positive 

pure imaginary 

center 

stable, but not asymp- 
totically stable 


B. Examples and Applications 
► Example 13.2 


Determine the nature of the critical point (0, 0) of the system 

d {= ix -*> 

and determine whether or not the point is stable. 


(13.35) 


Solution. The system (13.35) is of the form (13.19) where a = 2, b = —l,c = 3, and 
d = — 8. The characteristic equation (13.22) is 

X 2 + 62 + 5 = 0. 

Hence the roots of the characteristic equation are X { = — 5, X 2 = — 1. Since the roots 
are real, unequal, and of the same sign (both negative), we conclude that the critical 
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point (0,0) of (13.35) is a node. Since the roots are real and negative, the point is 
asymptotically stable. 


► Example 13.3 


Determine the nature of the critical point (0, 0) of the system 

dx 
~dt 

dy 


dt 


= 2x + 4 y, 

= - 2x + 6 y. 


(13.36) 


and determine whether or not the point is stable. 


Solution. Here a = 2, b = 4, c = — 2, and d = 6. The characteristic equation is 

X 2 - 82 + 20 = 0 

and its roots are 4 ± 2 i. Since these roots are conjugate complex but not pure 
imaginary, we conclude that the critical point (0, 0) of (13.36) is a spiral point. Since the 
real part of the conjugate complex roots is positive, the point is unstable. 


Application to Dynamics. Recall that in Chapter 5 we studied the free vibrations 
of a mass on a coil spring. The differential equation for the motion was 


d 2 x dx 

m + a — + kx = 0, 
dt 2 dt 


(13.37) 


where m > 0, a > 0, and k > 0 are constants denoting the mass, damping coefficient, 
and spring constant, respectively. We use a instead of a for the damping coefficient here 
to avoid confusion with the coefficient a of the system (13.19). 

The dynamical equation (13.37) is equivalent to the system 


dx 

li 


y > 


dy k a 

— = x y. 

dt m m 


(13.38) 


The solutions of the system (13.38) define the paths (phase trajectories) associated with 
the dynamical equation (13.37) in the xy phase plane. From (13.38) the differential 
equation of these paths is 

dy kx + ccy 
dx my 

We observe that (0, 0) is the only critical point of the system (13.38). The auxiliary 
equation of the differential equation (13.37) is 

mr 2 + ocr + k = 0, (13.39) 

while the characteristic equation of the system (13.38) is 


X 2 + — X + - = 0. 


m 


m 


(13.40) 
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TABLE 13.2 


Damping 
factor a 

Nature of 
roots of 
auxiliary and 
characteristic 
equations 

Form of solution of 
dynamical equation 

[k a 

01 = = 

m 2m 

Nature of 
critical point 
(0, 0) in xy 
phase plane 

Interpretation 

a = 0 (no 
damping) 

pure 

imaginary 

X = c cos (cot + </>) 

(stable) center 

oscillatory 
motion. Dis- 
placement 
and velocity 
are periodic 
functions of 
time 

a < lyjkm 
(under- 
damped) 

conjugate 
complex with 
negative real 
parts 

x = ce~ pt cos(a> 1 r+ 0), 
where 

oil = s/oi 2 - p 2 . 

asymptotically 
stable spiral 
point 

damped 
oscillatory 
motion. 
Displacement 
and velocity 
-►0 through 
smaller and 
smaller 
oscillations 

a = 2 y/km 

(critically 

damped) 

real, equal, 
and negative 

x = (c { + c 2 t)e~ pt 

asymptotically 
stable node 

displacement 
and velocity 
-►0 without 
oscillating 

a > 2 y/km 
(over- 
damped) 

real, unequal, 
and negative 

x = c { e rit + c 2 e ri \ 
where 

asymptotically 
stable node 

displacement 
and velocity 
—►0 without 
oscillating 


ri = ~P + s/P 2 -oi 2 . 




r 2 = -P-sfW ^ 1 

i 



The two equations (13.39) and (13.40) clearly have the same roots and A 2 . We 
summarize the possible situations in Table 13.2, which gives the form of the solution of 
the dynamical equation, the phase plane analysis of the critical point, and a brief 
interpretation. 


Exercises 


Determine the nature of the critical point (0, 0) of each of the linear autonomous 
systems in Exercises 1-8. Also, determine whether or not the critical point is stable. 


dx 

dt 


= X + 3y, 


dy 

dt 


= 3x + y. 
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2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 


dx 

li 

dx 

li 

dx 

dt 

dx 

dt 

dx 

li 

dx 

li 

dx 

dt 


3x + 2 y, 

dy 

j,- x+ 2y - 

3x + 4 y, 

^ = 3* + 2,. 
dt 

2x + 5y, 


2x - Ay, 


x - 2 y. 

+ 

II 

x-y. 


x + ly. 



9. Consider the linear autonomous system 


dx 

li 


= x 


+ * 



y- 


(a) Determine the nature of the critical point (0, 0) of this system. 

(b) Find the general solution of this system. 

(c) Find the differential equation of the paths in the xy plane and obtain the 
general solution of this differential equation. 

(d) Using the equation of the family of paths [the general solution obtained in 
part (c)], carefully plot the paths which pass through each of the following 
points: (1, 1), (- 1, - 1), (1, -3), (-1, 3), (0, 2), (0, -2), (0, 4), (0, -4), (1, 
- 1), ( — 1, 1), (2, —2), ( — 2, 2). Does the configuration thus obtained agree 
with your answer to part (a)? Comment. 


10. Consider the linear autonomous system 

dx 


dt 


= ax -f by. 


dy , 

— = cx + dy, 
dt 


(A) 


where a, b , c, and d are real constants. 

(a) Show that if a = d, and b and c are of opposite sign, then the critical point 
(0, 0) of system (A) is a spiral point. 

(b) Show that if a = d, and either borc = 0, then the critical point (0, 0) of (A) is 
a node. 
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(c) Show that if a = d, and b and c are of the same sign and such that y/bc <\a\, 
then the critical point (0, 0) of (A) is a node. 

(d) Show that if a = d, and b and c are of the same sign and such that yjbc > | a |, 
then the critical point (0, 0) of (A) is a saddle point. 

(e) Show that if a = — d and ad > be, then the critical point (0, 0) of (A) is a 
center. 

(f) Show that if a = —d, and b and c are of the same sign, then the critical point 
(0, 0) of (A) is a saddle point. 


13.3 CRITICAL POINTS AND PATHS OF NONLINEAR SYSTEMS 

A. Basic Theorems on Nonlinear Systems 

We now consider the nonlinear real autonomous system 

dx 


dt 


= P(x, y). 


is-®*'* 


(13.4) 


We assume that the system (13.4) has an isolated critical point which we shall choose 
to be the origin (0, 0). We now assume further that the functions P and Q in the right 
members of (13.4) are such that P(x, y) and Q(x, y) can be written in the form 


P(x, y) = ax + by + P t (x, y), 
Q(x, y) = cx + dy + Q t (x, y), 


(13.41) 


where 

1. a, b, c, and d, are real constants, and 


a b 
c d 


^ 0, 


and 


2. P t and Q x have continuous first partial derivatives for all (x, y), and are such that 


lim 


PM, y) 


= lim 


QM, y) 


(x,y)^(0,0) ^jx 2 + y 2 <x, JO-MO, 0) y Jx 2 ~+~y‘ 

Thus the system under consideration may be written in the form 

dx _ _ . 

= ax + by + Pi(x, y), 


= 0 . 


(13.42) 


dt 
dy 

— = cx + dy + Q i(x, Ji), 


(13.43) 


where a, b, c, d, P t , and Q i satisfy the requirements (1) and (2) above. 
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If P(x, y)andQ(x, y) in (13.4) can be expanded in power series about (0, 0), the system 
(13.4) takes the form 


y + a i2 x 2 + a 22 xy + a 2l y + •••, 


dx _[~dPl pPl 

dt L 5x J(0,0) L^J« 

r dQl fdQl , , , , , 

= TT x + 7 - y + b l2 x 2 + b 22 xy + b 2l y 2 + ■■■. 
L C7X J(0,0) L ^-1(0,0) 


d 2_ 

dt 


(13.44) 


This system is of the form (13.43), where P t (x, y) and Q t (x, y) are the terms of higher 
degree in the right members of the equations. The requirements (1) and (2) will be met, 
provided the Jacobian 


d(P, Q) 


d(x, y) 


¥> 0 . 


( 0 , 0 ) 


Observe that the constant terms are missing in the expansions in the right members of 
(13.44), since P( 0, 0) = Q( 0, 0) = 0. 

► Example 1 3.4 

The system 


= x + 2y + x 2 , 


dx 
dt 

j t = -lx-4y + 2y 2 

is of the form (13.43) and satisfies the requirements (1) and (2). Here a = 1, b = 2, 
c = —3 ,d= —4, and 


a b 
c d 


2 * 0 . 


Further P { (x, y) = 

x 2 , Qi(x, y) = 2 y 2 , and hence 


lim P ^±l± 

= lim 


(x.rt^(o.o) yj x 2 + y 2 

(x,y)^(0,0) 

and 


lim e j£± 

= lim 


(x,y)-(0,0) ^JX 2 + y 2 

(x,y)-+(0,0) 


= 0 


2>’ 2 


= 0 . 


By the requirement (2) the nonlinear terms P t (x, y) and Q t {x, y) in (13.43) tend to zero 
more rapidly than the linear terms ax + by and cx + dy. Hence one would suspect that 
the behavior of the paths of the system (1 3.43) near (0, 0) would be similar to that of the 
paths of the related linear system 
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obtained from (13.43) by neglecting the nonlinear terms. In other words, it would seem 
that the nature of the critical point (0, 0) of the nonlinear system (13.43) should be 
similar to that of the linear system (13.19). In general this is actually the case. We now 
state without proof the main theorem regarding this relation. 


THEOREM 13.7 


Hypothesis. Consider the nonlinear system 


— = ax + by + P t (x, y), 
dy 

— = cx + dy + <2,(x, y), 


(13.43) 


where a, b, c, d, P u and Q t satisfy the requirements (1) and (2) above. Consider also the 
corresponding linear system 



+ by, 



+ dy. 


(13.19) 


obtained from (13.43) by neglecting the nonlinear terms P t (x, y) and Qfx, y). Both 
systems have an isolated critical point at (0, 0). Let X Y and X 2 be the roots of the 
characteristic equation 

X 2 -(ay- d)X + (ad -bc) = 0 (13.22) 

of the linear system (13.19). 


Conclusions 

1. The critical point (0, 0) of the nonlinear system (13.43) is of the same type as that of 

the linear system (13.19) in the following cases: 

(i) If X Y and X 2 are real, unequal, and of the same sign, then not only is (0, 0) a node of 
(13.19), but also (0, 0) is a node of (13.43). 

(ii) If X x and X 2 are real, unequal, and of opposite sign, then not only is (0, 0) a saddle 
point of (13.19), but also (0, 0) is a saddle point of (13.43). 

(iii) If X x and X 2 are real and equal and the system (13.19) is not such that a = d / 0, 
b = c = 0, then not only is (0, 0) a node of (13.19), but also (0, 0) is a node of (13.43). 

(iv) If X t and X 2 are conjugate complex with real part not zero, then not only is (0, 0) a 
spiral point of (13.19), but also (0, 0) is a spiral point of (13.43). 

2. The critical point (0, 0) of the nonlinear system (13.43) is not necessarily of the same 

type as that of the linear system (13.19) in the following cases: 

(v) If X l and X 2 are real and equal and the system (13.19) is such that a = d / 0, b = 
c = 0, then although (0, 0) is a node of (13.19), the point (0, 0) may be either a node or 
a spiral point of (13.43). 
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Figure 13.13a Linear system. 


Figure 13.136 Nonlinear system. 


(vi) If and X 2 are pure imaginary , then although (0, 0) is a center of (13.19), the point 

(0, 0) may be either a center or a spiral point of (13.43). 

Although the critical point (0, 0) of the nonlinear system (13.43) is of the same type as 
that of the linear system (13.19) in cases (i), (ii), (iii), and (iv) of the conclusion, the actual 
appearance of the paths is somewhat different. For example, if (0, 0) is a saddle point of 
the linear system (13.19), then we know that there are four half-line paths entering (0, 0), 
two for t + oo and two for r — oo. However, at the saddle point of the nonlinear 
system (13.43), in general we have four nonrectilinear curves entering (0,0), two for 
t-+ - (-oo and two for t -► — oo, in place of the half-line paths of the linear case (see 
Figure 13.13). 

Theorem 13.7 deals with the type of the critical point (0, 0) of the nonlinear system 
(13.43). Concerning the stability of this point, we state without proof the following 
theorem of Liapunov. 


THEOREM 13.8 

Hypothesis. Exactly as in Theorem 13.7. 

Conclusions 

1. If both roots of the characteristic equation (13.22) of the linear system (13.19) are 
real and negative or conjugate complex with negative real parts, then not only is ( 0 , 0 ) an 
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asymptotically stable critical point of (13.19) but also (0, 0) is an asymptotically stable 
critical point of (13.43). 

2. If the roots of (13.22) are pure imaginary , then although (0, 0) is a stable critical 
point of (13.19), it is not necessarily a stable critical point of (13.43). Indeed, the critical 
point (0, 0) of (13.43) may be asymptotically stable, stable but not asymptotically stable, 
or unstable. 

3. If either of the roots of (13.22) is real and positive or if the roots are conjugate 
complex with positive real parts, then not only is (0, 0) an unstable critical point of (13.19), 
but also (0, 0) is an unstable critical point of (13.43). 


► Example 13.5 

Consider the nonlinear system 


dx A 2 

-=X + 4 ,-X, 

rfy (1145) 

— = 6 X- y + 2xy. 


This is of the form (13.43), where P, (x. v) = — x 2 and Q t (x, v) = 2 xy. We see at once 
that the hypotheses of Theorems 13.7 and 13.8 are satisfied. Hence to investigate the 
critical point (0, 0) of (13.45), we consider the linear system 


dx 

- = * + 4 ,, 



y> 


(13.46) 


of the form (13.19). The characteristic equation (13.22) of this system is 

X 2 - 25 = 0. 


Hence the roots are X i = 5, X 2 = - 5. Since the roots are real, unequal, and of opposite 
sign, we see from Conclusion (ii) of Theorem 13.7 that the critical point (0, 0) of the 
nonlinear system (13.45) is a saddle point. From Conclusion (3) of Theorem 13.8 we 
further conclude that this point is unstable. 

Eliminating dt from Equations (13.45), we obtain the differential equation 


dy _ 6x — y -F 2 xy 
dx x + Ay — x 2 


(13.47) 


which gives the slope of the paths in the xy phase plane defined by the solutions of 
(13.45). The first-order equation (13.47) is exact. Its general solution is readily found to 
be 


x 2 y -f 3x 2 — xy — 2 y 2 + c = 0, (13.48) 

where c is an arbitrary constant. Equation (13.48) is the equation of the family of paths 
in the xy phase plane. Several of these are shown in Figure 13.14. 
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► Example 1 3.6 

Consider the nonlinear system 


Using the expansion 


dx 

It 


sin x — 4 y. 


dy 

dt 


sin 2x — 5 y. 


sin x = 



we write this system in the form 

dx 
dt 

dy 


— x 


dt 


= 2x ■ 


A x 3 x 5 

4j '“T + 120 + '"’ 

4x 3 4x 5 

~ 3 T5 


(13.49) 


(13.50) 


The hypotheses of Theorems 13.7 and 13.8 are satisfied. Thus to investigate the critical 
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point (0, 0) of (13.49) [or (13.50)], we consider the linear system 


dx 

li 


= x -4 y. 


dy 

dt 


= 2x — 5 y. 


(13.51) 


The characteristic equation of this system is 

X 2 + 4X + 3 = 0. 

Thus the roots are X Y = — 3, X 2 = — 1. Since the roots are real, unequal, and of the 
same sign, we see from Conclusion (i) of Theorem 13.7 that the critical point (0, 0) of the 
nonlinear system (13.49) is a node. From Conclusion (1) of Theorem 13.8 we conclude 
that this node is asymptotically stable. 


► Example 13.7 


In this example we shall find all the real critical points of the nonlinear system 



-r 


dy 

dt 


— 6y + 6x 2 , 


(13.52) 


and determine the type and stability of each of these critical points. 

Clearly (0, 0) is one critical point of system (13.52). Also observe that (13.52) is of the 
form (13.43) and that the hypotheses of Theorems 13.7 and 13.8 are satisfied. To 
determine the type of critical point (0, 0), we consider the linear system 


dx 

li 


= 8x, 


dy 

dt 


= -6y, 


of the form (13.19). The characteristic equation of this linear system is 

A 2 - 2X - 48 = 0, 


and thus the roots are X t = $ and X 2 = —6. Since the roots are real, unequal, and of 
opposite sign, we see from Conclusion (ii) of Theorem 13.7 that the critical point (0, 0) 
of the given nonlinear system (13.52) is a saddle point. From Conclusion (3) of 
Theorem 13.8 we conclude that this critical point is unstable. 

We now proceed to find all other critical points of (13.52). By definition, the critical 
points of this system must simultaneously satisfy the system of algebraic equations 


8 x — y 2 = 0, 
— 6y + 6x 2 = 0. 


(13.53) 


From the second equation of this pair, y = x 2 . Then, substituting this into the first 
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equation of the pair, we obtain 

8x — x 4 = 0, 

which factors into 

x(2 — x)(4 + 2x + x 2 ) = 0. 

This equation has only two real roots, x = 0 and x = 2. These are the abscissas of the 
real critical points of (13.52); the corresponding ordinates are determined from y = x 2 . 
Thus we obtain the two real critical points (0, 0) and (2, 4). 

Since we have already considered the critical point (0, 0) and found that it is an 
(unstable) saddle point of the given system (13.52), we now investigate the type and 
stability of the other critical point (2, 4). To do this, we make the translation of 
coordinates 

f = x - 2, 

. (13.54) 

rj = y — 4, 

which transforms the critical point x = 2, y — 4 into the origin £ = rj = 0 in the £rj 
plane. We now transform the given system (13.52) into (£, rj) coordinates. From (13.54), 
we have 


x = £ + 2, y = >7 + 4; 

and substituting these into (13.52) and simplifying, we obtain 




j t = 24£ - 6r, + 6£ 2 , 


(13.55) 


which is (13.52) in ({, rj) coordinates. The system (13.55) is of the form (13.43) and the 
hypotheses of Theorems 13.7 and 13.8 are satisfied in these coordinates. To determine 
the type of the critical point £ = rj = 0 of (13.55), we consider the linear system 


dt 

dt 


= H 


drj 

It 


= 24£ - 6 rj. 


The characteristic equation of this linear system is 

X 2 - 2X + 144 = 0. 

The roots of this system are 1 ± x /l43 i, which are conjugate complex with real part not 
zero. Thus by Conclusion (iv) of Theorem 13.7, the critical point £ = rj = 0 of the 
nonlinear system (13.55) is a spiral point. From Conclusion (3) of Theorem 13.8, we 
conclude that this critical point is unstable. But this critical point is the critical point 
x = 2, y = 4 of the given system (13.52). Thus the critical point (2, 4) of the given sys- 
tem (13.52) is an unstable spiral point. 

In conclusion, the given system (13.52) has two real critical points, namely: 

1. critical point (0, 0); a saddle point; unstable; 

2. critical point (2, 4); a spiral point; unstable. 
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► Example 1 3.8 


Consider the two nonlinear systems 


and 


dx 

~di 


-y - x 2 . 



dx 

It 


-y 



(13.56) 


(13.57) 


The point (0, 0) is a critical point for each of these systems. The hypotheses of 
Theorem 13.7 are satisfied in each case, and in each case the corresponding linear 
system to be investigated is 

dx 

Tt = ~ y ' 

( 13 - 5g ) 


The characteristic equation of the system (13.58) is 

A 2 + 1 = 0 

with the pure imaginary roots ± i. Thus the critical point (0, 0) of the linear system 
(13.58) is a center. However, Theorem 13.7 does not give us definite information 
concerning the nature of this point for either of the nonlinear systems ( 1 3.56) or ( 1 3.57). 
Conclusion (vi) of Theorem 13.7 tells us that in each case (0, 0) is either a center or a 
spiral point; but this is all that this theorem tells us concerning the two systems under 
consideration. 

Hurewicz ( Lectures on Ordinary Differential Equations , p. 99) shows that the critical 
point (0, 0) of the system ( 1 3.56) is a center , while the same point for the system ( 1 3.57) is 
a spiral point. 


Suppose we consider a nonlinear real autonomous system of the form 

£ = 2<*.A 

where no linear terms are present and P and Q are of the respective forms 

P(x, y) = Ax 2 + Bxy + Cy 2 
Q{x , y) = Dx 2 + Exy + Fy 2 . 


and 
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Then we find that the origin (0, 0) will be one of several types of critical point which are 
of a more complicated nature than any of the simple types that we have previously 
encountered. 


► Example 1 3.9 


Two systems of the preceding type are given by 



and 


dy 

dt 


= 3 y 2 - x 2 . 



(13.59) 


dy 

dt 


= 2 y 2 


-xy. 


(13.60) 


Qualitative portraits of the paths in the neighborhood of the critical point (0, 0) for the 
systems (13.59) and (13.60) are shown in Figures 13.15 and 13.16, respectively. 

The systems of this example and their corresponding phase plane portraits in the 
neighborhood of their common critical point (0, 0) should be sufficient to suggest that a 
wide variety of critical points is possible for various types of more general nonlinear 
systems. Indeed, for the general nonlinear autonomous system 


dx 

m =P{x - A 

dy 

{ r Q( X ,y), 


(13.4) 




Figure 13.15 


Figure 13.16 
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an infinite number of possible types of critical points exists, and, depending upon the 
nature of the nonlinear terms in (13.4), there are an infinite number of types which are 
incredibly complicated. 


B. Nonlinear Conservative Systems 


Let us consider a dynamical system in which the dissipation of energy is so slow that it 
may be neglected. Thus neglecting this slow dissipation, we assume the law of 
conservation of energy, namely, that the sum of the kinetic energy and the potential 
energy is a constant. In other words, the system is regarded as conservative. 

Specifically, we consider a particle of mass m in rectilinear motion under the action 
of restoring force F which is a function of the displacement x only. The differential 
equation of motion is thus 


m 


d 2 x 

dF 


= F(x), 


(13.61) 


where we assume that F is analytic for all values of x. The differential equation (13.61) is 
equivalent to the nonlinear autonomous system 


dx 

Tt= y ' 


dy 

dt 


F(x) 


m 


(13.62) 


Eliminating dt we obtain the differential equation of the paths defined by the 
solutions of (13.62) in the xy phase plane: 


dy_ 

dx 


m 

my 


(13.63) 


Separating variables in this equation, we obtain 

my dy = F(x) dx. 

Suppose x = x 0 and y( = dx/dt) = y 0 at t = t 0 . Then, integrating, we find 


imy 2 - %myl 


-f 


F(x) dx 


or 


\my 2 


r 


F(x) dx - jmyl - 


F(x) dx. 


(13.64) 


But jm(dx/dt) 2 = \my 2 is the kinetic energy of the system and 


V(x) = 


F(x) dx 


is the potential energy. Thus Equation (13.64) takes the form 

|my 2 + V(x) = h. 


(13.65) 
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where the constant 


h = ±myl - 


*X0 

F(x) dx = 2 m yo + F(x 0 ) ^ 0 

0 


is the total energy of the system. 

Since Equation (13.65) was obtained by integrating (13.63), we see that (13.65) gives 
the family of paths in the xy phase plane. For a given value of h , the path given by 
(13.65) is a curve of constant energy in this plane. In other words, along a particular 
path the total energy of the systemds a constant; this expresses the law of conservation 
of energy. 

The critical points of the system (13.62) are the points with coordinates (x c , 0), where 
x c are the roots of the equation F(x) = 0. These are equilibrium points of the given 
dynamical system. From the differential equation (13.63) we see that the paths cross 
the x axis at right angles and have horizontal tangents along the lines x = x c . From 
Equation (13.65) we observe that the paths are symmetrical with respect to the x axis. 

From the equation (13.65) of the paths we find at once 


This may be used to construct the paths in the following convenient way. 


(13.66) 


1. On a rectangular xY plane construct the graph of Y = V(x) and the lines Y = h, for 
various values of h [see Figure 13.17a, where one such line is shown]. For each 
fixed value h this shows the difference 6 — V(x) for each x. 

2. Directly below the xY plane draw the xy phase plane, with the y axis on the same 
vertical line as the Y axis [see Figure 13.176]. For each x multiply the difference 
h — V(x) obtained from the graph in Step (1) by the constant factor 2/m and 
use (13.66) to plot the corresponding y value on the phase plane. 


We observed that the critical points of the system (13.62) are the points (x c , 0) such 
that F(x c ) = 0. Now note that F(x) = — V'(x), and recall that V'(x c ) = 0 implies that 
V(x) has either a relative extremum or a horizontal inflection point at x = x c . We thus 
conclude that all the critical points of (13.62) lie on the x axis and correspond to points 
where the potential energy V(x) has a relative minimum, a relative maximum, or a 
horizontal inflection point. We now discuss each of these three possibilities, using the 
graphical technique just introduced. 


1. Suppose V(x) has a relative minimum at x = x c (see Figure 13.18). 


Let V(x c ) = h 0 > 0, and consider the path corresponding to a typical value h t of h 
such that h t > h 0 . For h = h l9 the values of y given by (13.66) are real for a x < x < b l 
and equal to zero at both x = a t and x = b t . Hence, using the symmetry of the paths, 
we see that the path corresponding to h = h t is a closed ellipse-like curve surrounding 
the critical point (x c , 0). This is typical of all paths corresponding to all h > h 0 and 
sufficiently near to the critical point (x c , 0). Thus, we conclude that if the potential 
energy V(x) has a relative minimum at x = x c , then the critical point (x c , 0) is a center 
and so is stable. 


2. Suppose V(x) has a relative maximum at x = x c (see Figure 13.19). 
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Let V{x c ) = h 0 > 0, and consider the paths corresponding to a typical value h Y oi h 
such that h t < h 0 . For h — h t , the values of y given by (13.66) are real both for x <a l 
(and sufficiently close to x = a t ) and for x> b t (and sufficiently close to x = b x ); the 
value of y is equal to zero at both x = a Y and x = b t . Thus for x < a i (and sufficiently 
close to x = a L ), the path corresponding to h = h t is a hyperbolic-like curve opening to 
the left of x = a l9 whereas, for x > b t (and sufficiently close to x = b l ) the path 
corresponding to h = h x is a hyperbolic-like curve opening to the right of x = b t . 

Now consider the paths corresponding to a typical value h 2 of h such that h 2 > h 0 . 
For h = h 2 , the values of y given by (13.66) are real for all x sufficiently close to x c . 
Moreover, the positive value of y has a relative minimum (say m) corresponding to 
x = x c , and the negative value has a relative maximum (which would then be 
— m) corresponding to x = x c . Thus for all x sufficiently close to x c , the paths 
corresponding to h = h 2 are a pair of hyperbolic-like curves, one opening upward 
above (x c , m) and the other opening downward below (x c , —m). 
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Finally, consider the paths corresponding to h = h 0 itself. For h = h 0 , the value of y 
given by (13.66) for x = x c is zero. The values of y given by (13.66) for x / x c (and 
sufficiently close to x = x c ) are real and of opposite sign. Thus, there are two geometric 
curves which cross the x axis at x = x c , and these two curves provide four paths which 
approach and enter the critical point (x c , 0), two as t -► + oo and two as t -► — oo. From 
the discussion in this and the two preceding paragraphs, we conclude the following. If 
the potential energy V(x) has a relative maximum at x = x c , then the critical point 
(x c , 0) is a saddle point and so is unstable. 

3. Suppose V(x) has a horizontal inflection point at x = x c (see Figure 13.20). 

To be definite, suppose V(x) is strictly increasing for all x / x c (and sufficiently close 
to x c ); and let V(x c ) = h 0 > 0. Then, proceeding as in the discussions of cases 1 and 2, 
we find that the paths (for x sufficiently close to x c ) have the appearance shown in the 
phase plane part of Figure 13.20. For typical h / h 0 (say h t < h 0 or h 2 > h 0 ), the paths 
merely have x-axis symmetry and intersect that axis at right angles. For h = h 0 , the 
symmetry remains, but the path has a so-called cusp as it passes through the critical 
point (x c , 0) (see Figure 13.20 again). 

The critical point (x c , 0) is a “degenerate” type of critical point and is unstable. The 
case in which V(x) is strictly decreasing (instead of increasing) is analogous. 

We have presented a graphical treatment of the situation. We point out that an 
analytic treatment, using principles of elementary calculus, could also be employed to 
obtain the same results. We summarize these results as the following theorem. 




Figure 13.20 
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THEOREM 13.9 


Let 


d 2 x 


(13.61) 


where F is analytic for all values of x , be the differential equation of a conservative 
dynamical system. Consider the equivalent autonomous system 

dx 

i; = y ’ 


dy _ F(x) 
dt m 


(13.62) 


and let (x c , 0) be a critical point of this system. Let V be the potential energy function of 
the dynamical system having differential equation (13.61); that is, V is the function defined 
by 

K(x) = -J* F(x) dx. 

Then we conclude the following: 

1. If the potential energy function has a relative minimum at x = x c , then the critical 
point (x c , 0) is a center and is stable. 

2. If the potential energy function has a relative maximum at x = x c , then the critical 
point (x c , 0) is a saddle point and is unstable. 

3. If the potential energy function has a horizontal inflection point at x = x c , then the 
critical point (x c , 0) is of a “ degenerate ” type called a cusp and is unstable. 


► Example 13.10 


Suppose F(x) = 4x 3 - 4x and m = 1. Then the differential equation (13.61) of motion 
is 


d 2 x 

It 2 


= 4x 3 — 4x. 


The equivalent nonlinear system (13.62) is 

dx 

Tt =y ' 

^_ iY 3 


(13.67) 


dt 


= 4x 3 — 4x. 


The critical points of this system are (0, 0), (1, 0), and (— 1, 0). The differential equation 
(13.63) of the paths in the xy phase plane is 

dy 4x 3 — 4x 
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Integrating this we obtain the family of paths 

jy 2 — x 4 + 2x 2 = h. (13.68) 

This is of the form (13.65), where 

F(x) = - J F(x) dx = -x 4 + 2x 2 (13.69) 

is the potential energy. 

We now construct the xy-phase-plane diagram by first constructing the curve Y = 
V(x) and the lines Y = h for various values of h in the xY plane [see Figure 13.21a]. 
From this xY plane we read off the values h — V(x) = h + x 4 — 2x 2 and plot the paths 
in the xy phase plane directly below [see Figure 13.215] using 

y= ± n /2(/i + x 4 — 2x 2 ). 

From the formula 

V{x)= — x 4 + 2x 2 

for the potential energy, we find 

V'(x) = -4x 3 + 4x = — 4x(x + l)(x - 1) 
and 

K"(x) = -12x 2 + 4. 

Thus we see that V'(x) = 0 at x = — 1, x = 0, and x = 1. We also observe that V"(— 1) 
< 0, F"(0) > 0, F"(l) < 0. Thus the potential energy function V has a relative 
minimum at x = 0 and relative maxima at x = — 1 and x = 1. Hence, by Theorem 13.9, 
the critical point (0, 0) is a center and is stable and the critical points ( — 1, 0) and (1,0) 
are saddle points and are unstable. 

From Figure 13.21 we see that if the total energy h is less than 1 and x is initially 
between — 1 and + 1, then the corresponding paths are closed and hence the motion is 
periodic. If, however, h > 1 , the paths are not closed and the motion is not periodic. The 
value h = 1 clearly separates the motions of one type (periodic) from those of another 
(nonperiodic). For this reason the corresponding path is called a separatrix. We observe 
the nature of the critical points from the phase plane diagram. The point (0, 0) is clearly 
a center , while the points (1,0) and ( — 1, 0) are saddle points. 

► Example 13.11 

Consider a simple pendulum which is composed of a mass m (the bob) at the end of a 
straight wire of negligible mass and length /. Suppose this pendulum is suspended from 
a fixed point S and is free to vibrate in a vertical plane. Let x denote the angle which the 
straight wire makes with the vertical ray from S downward at time t (see Section 5.2, 
Exercise 8, and Figure 5.4, where 6 there has been replaced by x here). We neglect air 
resistance and assume that the only forces acting on the mass m are the tension in the 
wire and the force due to gravity. Then the differential equation of the displacement of 
the pendulum is 

9 . 

= -- sin x, 


d 2 x 

It 1 
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(b) 

Figure 13.21 
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where g is the acceleration due to gravity. For simplicity, we let g = / = 1, and consider 
the special case 

d 2 x 

-r^r = — sin x. 
dt 2 

This differential equation is of the form (13.61), where m = 1 and F(x) = —sin x. 

The equivalent nonlinear system of the form (13.62) is 


dx 



dy 

— = — sin x. 
dt 

The critical points of this system are the points (x c ,0), where x c are the roots of 
sin x = 0. Thus the critical points are the infinite set of points 

(nn 9 0), where n = 0, ±1, + 2, . . . . 

The differential equation of the paths in the xy phase plane is 

dy _ sin x 
dx y 

Integrating, we obtain the family of paths 

\y 2 = cos X + C 


or 


jy 2 + (1 — cos x) = h 9 
where jy 2 is the kinetic energy, 


V(x) = 


F(x) dx = 
o 


sin x dx = 1 — cos x 
o 


is the potential energy, and h = c + 1 is the total energy of the system. 

We may now construct the xy-phase-plane diagram by first constructing the curve 
Y = V(x) and the lines Y = h for various values of h > 0 in the x Y plane [see 
Figure 13.22a]. From this xY plane, we read off the values h — V(x) = h + cos x — 1 
and plot the paths in the xy phase plane directly below [see Figure 13.22b] using 

y = ±y/2(h + cos x — 1). 

From the formula 


V(x) = 1 — cos x 

for the potential energy, we find 

V\x) = sin x and V"(x) = cos x. 

Thus we see that V'(x) = 0 at x = nn 9 where n = 0, ±1, ±2,. . . . We also observe that 
V"(nn) > 0 if n is even and V"(nn) < 0 if n is odd. Thus the potential Qnergy function V 
has a relative minimum at x = 2nn for every even integer In (n = 0, ±1, ± 2, . . . ) and a 
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Figure 13.22 


relative maximum at x = (2 n + \)n for every odd integer 2n + 1 (n = 0, ±1, ±2,...). 
Hence, from Theorem 13.9 we conclude the following: The critical points 

{2nn, 0), n — 0, ± 1 , ± 2, . . . , are centers 

and are stable; and the critical points 

[(2 n+ l)7c, 0], n = 0, ±1, ±2,..., are saddle points 

and are unstable. 

From Figure 13.22 we see that if the total energy h is less than 2, then the 
corresponding paths are closed. Each of these paths surrounds one of the centers 
(2wr,0). Physically, all of these (stable) centers correspond to exactly one physical 
state, namely, the pendulum at rest with the bob in its lowest position (the stable equi- 
librium position). Thus, each closed path about a center corresponds to a periodic 
back-and-forth oscillatory motion of the pendulum bob about its lowest position. 

On the other hand, if the total energy h is greater than 2, then the corresponding 
paths are not closed. Clearly x->+ooast->+ooify = dx/dt > 0, and x -► — oo as 
t -► + oo if y = dx/dt < 0. Thus the motion corresponding to such a path does not 
define x as a periodic function of t. Nevertheless, physically, the corresponding motion 
is periodic. To see this, we note the following: All of the (unstable) saddle points 
[(2 n + l)7c, 0] correspond to exactly one physical state, namely, the pendulum at rest 
with the bob in its highest position (the unstable equilibrium position); and each of the 
nonclosed paths corresponds to a physically periodic round-and-round motion of the 
pendulum about its point of support. 
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The total energy value h = 2 separates the back-and-forth type of motion from the 
round-and-round type. The paths corresponding to h = 2 are the ones which enter the 
saddle points as t -► + oo or t -► — oo, and are the separatrices. 


Dependence on a Parameter. We briefly consider the differential equation of a 
conservative dynamical system in which the force F depends not only on the 
displacement x but also on a parameter X, Specifically, we consider a differential 
equation of the form 


d 2 x 

Tt 


TT = F(x, A), 


(13.70) 


where F is analytic for all values of x and X. The differential equation (13.70) is 
equivalent to the nonlinear autonomous system 


dx 


(13.71) 


For each fixed value of the parameter X, the critical points of (13.71) are the points with 
coordinates (x c , 0), where the abscissas x c are the roots of the equation F(x, X) = 0, 
considered as an equation in the unknown x. In general, as X varies continuously 
through a given range of values, the corresponding x c vary and hence so do the 
corresponding critical points, paths, and solutions of (13.71). A value of the parameter 
X at which two or more critical points coalesce into less than their previous number (or, 
vice versa, where one or more split up into more than their previous number) is called a 
critical value (or bifurcation value) of the parameter. At such a value the nature of the 
corresponding paths changes abruptly. In determining both the critical values of the 
parameter and the critical points of the system (13.71), it is often very useful to 
investigate the graph of the relation F(x, X) = 0 in the xX plane. 


► Example 13.12 

Consider the differential equation 


d 2 x 

IF 


= x 2 — 4x + X 


of the form (13.70), where 

F(x, X) = x 2 — 4x + X 

and A is a parameter. The equivalent nonlinear system of the form (13.71) is 

dx 


(13.72) 

(13.73) 


dt 

dy 

dt 


= y> 


= x 2 — 4x 4- X. 


(13.74) 


The critical points of this system are the points (x l9 0) and (x 2 , 0), w'here x x and x 2 are 
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the roots of the quadratic equation F(x , X) = 0; that is, 


in the unknown x. We find 


x 2 — Ax + X = 0, 

. 4±yi6-‘ u _ 2± ^-x 


(13.75) 


Thus the critical points of (13.74) are 

(2 + 74 - A, 0) and (2 - - A, 0). (13.76) 

For A < 4, the roots 2 + ^/4 — A of the quadratic equation (13.75) are real and 
distinct; for 1 = 4, the roots are real and equal, the common value being 2; and for 
1 > 4, they are conjugate complex. Thus for 1 < 4, the critical points (13.76) are real 
and distinct. As l->4—, the two critical points approach each other; and at 1 = 4, 
they coalesce into the one single critical point (2, 0). For 1 > 4, there simply are no real 
critical points. Thus we see that 1 = 4 is the critical value of the parameter. 

We consider the three cases introduced in the preceding paragraph. 

1. 1 < 4. For each fixed value X 0 of 1 such that 1 0 < 4, the critical points of (13.74) 
are the real distinct points with coordinates 

(2 + V4 — A o ,0) and (2 - ^/4 - A 0 , 0). 

For each such 1 0 , the potential energy function V is the function of x only defined by 

f* f* x ^ 

K(x, X 0 ) = — F(x, X 0 ) dx = — (x 2 — 4x + X 0 ) dx = — — 4- 2x 2 — l 0 x. 

Jo Jo 3 

Then 


V'(x, X 0 ) = — F(x, X 0 ) = —x 2 + 4x — X 0 and K"(x, 1 0 ) = — 2x + 4. 
Let x = 2 — ^4 — 1 0 . Then 

V(2 - V4 - A 0 , A 0 ) = 0 and K"(2 - ^4 - A 0 , A 0 ) > 0. 


Thus the potential ener gy K(x, 1 0 ) has a relative minimum at x = 2 — yj 4 — 1 0 , and so 
the critical point (2 — ^/4 — X 0 , 0) is a center and is stable. In like manner, one finds that 
the critical point (2 + yj 4 — A 0 , 0) is a saddle point and is unstable. As a particular 
illustration of Case 1, we choose A 0 = 3 < 4. Then the two critical points are (1, 0), the 
center, and (3, 0), the saddle point. A qualitative portrait of the critical points and paths 
in the xy phase plane for this value = 3 is shown in Figure 13.23. 

2. X = 4. As already noted, this is the critical value of the parameter and there is 
only the one critical point (2, 0) of the system (13.74) corresponding to it. With x = 2 
and X 0 = 4, we quickly find that K'(2, 4) = 0 and V"(2 , 4) = 0. However, it is easy to see 
that F'"(x,A 0 ) = —2 for all x and A 0 , and so F"'(2,4) / 0. Thus the potential energy 
has a horizontal inflection point at x = 2, and so the critical point (2,0) is a so-called 
cusp and is unstable. A qualitative portrait of the critical point and paths in the xy 
phase plane is shown in Figure 13.24. 

3. X > 4. For each fixed value X 0 of X such that X 0 > 4, there are no real critical 
points of the system (13.74). Of course, there are still paths in the xy phase plane and a 
qualitative portrait of them for X 0 = 5 > 4 is shown in Figure 13.25. 
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Figure 13.24 


In the general discussion preceding this example, we pointed out that the graph of 
the relation F(x , X) = 0 in the xX plane is often useful in determining both the critical 
values of the parameter X and the critical points of the given nonlinear system (13.71). 
We illustrate this for the particular system (13.74) of this example. From (13.73) we 
again observe that F(x, X) = x 2 — 4x + X in this example and hence that F(x, X) = 0 is 
x 2 — 4x 4- X = 0. Writing this relation as 

X= -x 2 + 4x, (13.77) 

we see at once that its graph in the xX plane is a parabola with vertex at the point (2, 4), 
axis along the line x = 2, and which opens downward (see Figure 13.26). 
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For each fixed value of X, the critical points of system (13.74) are the points in the xy 
phase plane having coordinates (x c , 0), where x c is a root of X = — x 2 + 4x, considered 
as an equation in x. From the graph of X = — x 2 + 4x, we at once observe the following 
results: For each fixed X 0 < 4, there exist two real critical points (x c , 0) of the system 
(13.74). The abscissas x c of these two points are the abscissas of the two points in the xX 
plane where the horizontal line X = X 0 intersects the parabola X = — x 2 + 4x. For 
X 0 = 4, there exists only one real critical point (x c , 0) of the system (13.74). The abscissa 
x c of this point is the abscissa 2 of the single point where the horizontal line X 0 = 4 is 
tangent to the parabola X = — x 2 + 4x at its vertex. Finally, for X 0 > 4, there exist no 
real critical points (x c , 0) of the system (13.74), since the horizontal line X = X 0 lies 
above the vertex of the parabola X = — x 2 + 4x and so never intersects the parabola. 
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C. Liapunov's Direct Method 


In Section 13.3B we observed the following concerning an equilibrium point of a 
conservative dynamical system: If the potential energy has a relative minimum at the 
equilibrium point, then the equilibrium point is stable; otherwise, it is unstable. This 
principle was generalized by the Russian mathematician Liapunov to obtain a method 
for studying the stability of more general autonomous systems. The procedure is 
known as Liapunov's direct (or second) method. 

Consider the nonlinear autonomous system 


dx 

dv 


(13.4) 


Assume that this system has an isolated critical point at the origin (0, 0) and that P and 
Q have continuous first partial derivatives for all (x, y). 


DEFINITIONS 

Let £(x, y) have continuous first partial derivatives at all points (x, y) in a domain D 
containing the origin (0, 0). 

1. The function E is called positive definite in D if E( 0, 0) = 0 and £(x, y) > 0 for all 
other points (x, y) in D. 

2. The function E is called positive semidefinite in D if E( 0, 0) = 0 and £(x, y) > 0 for 
all other points (x, y) in D. 

3. The function E is called negative definite in D if E( 0, 0) = 0 and £(x, y) < 0 for all 
other points in D. 

4. The function E is called negative semidefinite in D if E( 0, 0) = 0 and £(x, y) < 0 
for all other points (x, y) in D. 


► Example 13.13 

The function E defined by £(x, y) = x 2 + y 2 is positive definite in every domain D 
containing (0, 0). Clearly, £( 0, 0) = 0 and £(x, y) > 0 for all (x, y) / (0, 0). 

The function E defined by £(x, y) = x 2 is positive semidefinite in every domain D 
containing (0, 0). Note that £( 0, 0) = 0, E( 0, y) = 0 for all (0, y) such that y / 0 in D, and 
£(x, y) > 0 for all (x, y) such that x # 0 in D. There are no other points in D, and so we 
see that E( 0, 0) = 0 and £(x, y) > 0 for all other points in D. 

In like manner, we see that the function E defined by £(x, y) = — x 2 — y 2 is negative 
definite in D and that defined by £(x, y) = — x 2 is negative semidefinite in D. 


DEFINITION 

Let £(x, y) have continuous first partial derivatives at all points (x, y) in a domain D 
containing the origin (0, 0). The derivative of £ with respect to the system (13.4) is the 



13.3 CRITICAL POINTS AND PATHS OF NONLINEAR SYSTEMS 683 


function E defined by 


E(x, y) = 


dE(x, y) 

8x 


P{x, y) + 


SE(x, y) 

dy 


Q(x, y)- 


(13.78) 


► Example 13.14 


Consider the system 


dx 

li 


-x + y 2 , 


dy 

dt 


= -y + x 2 . 


(13.79) 


and the function E defined by 

E(x, y) = x 2 + y 2 . (13.80) 

For the system (13.79), P(x, y) = —x + y 2 , Q(x, y) = — y + x 2 ; and for the function E 
defined by (13.80), 


dE{x, y) dE(x, y) 

“ 2x ‘ —y 2r - 

Thus the derivative of E defined by (13.80) with respect to the system (13.79) is given by 
£(x, y) = 2x(-x + y 2 ) + 2y( — y + x 2 ) = -2(x 2 + y 2 ) + 2 (x 2 y + xy 2 ). 

(13.81) 


Now let C be a path of system ( 1 3.4); let x = f(t),y = g(t) be an arbitrary solution of 
(13.4) defining C parametrically; and let E(x, y) have continuous first partial derivatives 
for all (x, y) in a domain containing C. Then £ is a composite function of t along C; and 
using the chain rule, we find that the derivative of E with respect to i along C is 


dEU{t\g(t)-] 

dt 


E x U(t),g(tn^P- 


+ E y U(t), g(t)l 


dgjt) 

dt 


= E x U(i), g(t)-]PU(t), g(t)~] + E y [f(t), g(t)mm, 0(f)] 

= £ [/(f), 0(f)]. (13.82) 

Thus we see that the derivative of £[/(f), 0(f)] with respect to t along the path C is 
equal to the derivative of E with respect to the system (13.4) evaluated at x = f(t), 

y = 0(f)- 


DEFINITION 

Consider the system 

~ = P(x, y), j^ = Q(x,y). (13.4) 


Assume that this system has an isolated critical point at the origin (0, 0) and that P and Q 
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have continuous first partial derivatives for all (x, y). Let E(x , y) be positive definite for all 
(x, y) in a domain D containing the origin and such that the derivative E(x , y) of E with 
respect to the system (13.4) is negative semidefinite for all (x, y) e D. Then E is called a 
Liapunov function for the system (13.4) in D. 

► Example 13.15 

Consider the system 

dx 

~di 

dy 
dt 

and the function E defined by 

£(x, y) = x 2 + y 2 , (13.80) 

introduced in Example 13.14. Obviously the system (13.79) satisfies all the require- 
ments of the immediately preceding definition in every domain containing the critical 
point (0, 0). Also, in Example 1 3. 1 3 we observed that the function E defined by ( 1 3.80) is 
positive definite in every such domain. In Example 13.14, we found the derivative of E 
with respect to the system (13.79) is given by 

£(x, y) = — 2(x 2 + y 2 ) + 2(x 2 y + xy 2 ) (13.81) 

for all (x, y). If this is negative semidefinite for all (x, y) in some domain D containing 
(0, 0), then E defined by (13.80) is a Liapunov function for the system (13.79). 

Clearly £( 0, 0) = 0. Now observe the following: If x < 1 and y ^ 0, then xy 2 < y 2 ; 
if y < 1 and x ^ 0, then x 2 y < x 2 . Thus if x < 1, y < 1, and (x, y) ^ (0, 0), x 2 y + 
xy 2 < x 2 + y 2 and hence 

-(x 2 + y 2 ) + (x 2 y + xy 2 ) < 0. 

Thus in every domain D containing (0, 0) and such that x < 1 and y < 1, £(x, y) given 
by (13.81) is negative definite and hence negative semidefinite. Thus E defined by (13.80) 
is a Liapunov function for the system (13.79). 

We now state and prove two theorems on the stability of the critical point (0, 0) of 
system (13.4). 



THEOREM 13.10 

Consider the system 


d i=® x ’ y) - 


(13.4) 


Assume that this system has an isolated critical point at the origin (0, 0) and that P and Q 
have continuous first partial derivatives for all (x, y). If there exists a Liapunov function 
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E for the system {13.4) in some domain D containing (0, 0), then the critical point (0, 0) of 
{13.4) is stable. 

Proof. Let K € be a circle of radius e > 0 with center at the critical point (0, 0), where 
6 > 0 is small enough so that this circle K € lies entirely in the domain D (see 
Figure 13.27). From a theorem of real analysis, we know that a real-valued function 
which is continuous on a closed bounded set assumes both a maximum value and a 
minimum value on that set. Since the circle K € is a closed bounded set in the plane and E 
is continuous in D and hence on K €9 the real analysis theorem referred to in the 
preceding sentence applies to E on K € and so, in particular, E assumes a minimum value 
on K € . Further, since E is also positive definite in D (why?), this minimum value must be 
positive. Thus E assumes a positive minimum m on the circle K € . Next observe that 
since E is continuous at (0, 0) and E{ 0, 0) = 0, there exists a positive number 6 satisfying 
3 < 6 such that E{x , y) < m for all (x, y) within or on the circle K d of radius 3 and center 
at (0, 0). (Again see Figure 13.27.) 

Now let C be any path of (13.4); let x = f{t\ y = g{t) be an arbitrary solution of 
(13.4) defining C parametrically; and suppose C defined by [f{t) 9 g{t)~\ is at a point 
within the “inner” circle K d at t = t 0 . Then 

£[/(*o)> 0(<o)] < m - 

Since E is negative semidefinite in D (why?), using (13.82), we have 

dEUVuw 

dt ~ 

for [/(t), g(t)] e D. Thus £[/(f), g(0] is a nonincreasing function of t along C. Hence 

£[/(*)> 0(03 ^ Elf (to), 0(fo)] < m 



Figure 13.27 
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for all t > t 0 . Since £[/(£), git)'] would have to be >m on the “cuter” circle K €9 we see 
that the path C defined by x = /(£), y = g(t) must remain within K € for all t > t 0 . Thus, 
from the definition of stability of the critical point (0, 0), we see that the critical point 
(0, 0) of (13.4) is stable. Q.E.D. 


THEOREM 13.11 


Consider the system 


f- p 


(13.4) 


Assume that this system has an isolated critical point at the origin (0, 0) and that P and Q 
have continuous first partial derivatives for all (x, y). If there exists a Liapunov function 
E for the system (13.4) in some domain D containing (0,0) such that E also has the 
property that E defined by (13.78) is negative definite in D , then the critical point (0, 0) of 
(13.4) is asymptotically stable. 


Proof. As in the proof of the previous theorem, let K € be a circle of radius € > 0 
with center at the critical point (0, 0) and lying entirely in D. Also, let C be any path of 
(13.4); let x = /(£), y = g(t) be an arbitrary solution of (13.4) defining C parametrically; 
and suppose C defined by [/(£), g(t)] is at a point within K c at t = t 0 (see Figure 13.28). 



Figure 13.28 
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Now since £ is negative definite in D. using (13.82), we have 

dE M <0 

dt 


for [/(t), g(t)~] e D. Thus £[/(t), g(t)~] is a strictly decreasing function of t along C. 
Since E is positive definite in D, E[/(t), g(t)] > 0 for [/(t), g(t)] e D. Thus 
lim E[f(t), 0(f)] exists and is some number L > 0. We shall show that L = 0. 

t-*oo 

On the contrary, assume that L > 0. Since E is positive definite, there exists a positive 
number y satisfying y < e such that £(x, y) < L for all (x, y) within the circle K y of 
radius y and center at (0, 0) (again see Figure 13.28). Now, we can apply the same real 
analysis theorem on maximum and minimum values that we used in the proof of the 
preceding theorem to the continuous function E on the closed region R between and on 
the two circles K € and K y . Doing so, since E is negative definite in D and hence in this 
region R which does not include (0, 0), we see that E assumes a negative maximum — k 
on R. Since £[/(£), g(t)~\ is a strictly decreasing function of t along C and 

lim £[/(!), flf(t)] = L, 

t~* co 


the path C defined by x = f(t), y = g(t) cannot enter the domain within K y for any 
t > t 0 and so remains in R for all t > t 0 . Thus we have £[/(t), g(t)] < — k for all 
t> t 0 . Then by (13.82) we have 


dEimgm 

dt 


= E[f(t),g(t)l< 


-k 


(13.83) 


for all t>t 0 . Now consider the identity 

£[/(f), 3(1)] - £[/(to). 9(t 0 )l = £ dEUi ^ m dt. 
Then (13.83) gives 

£[/(!), g(t)] - E[/(t 0 ), 0(«o)] ^ - | ' kdt 

J to 

and hence 

£[/(!), 3(1)] ^ £[/(f 0 )> 3(lo)] - k(t - t 0 ) 
for all t > t 0 . Now let f -» oo. Since —k< 0, this gives 


(13.84) 


lim £[/(!), g(t)1 = -co. 


But this contradicts the hypothesis that £ is positive definite in D and the assumption 
that 

lim £[/(!), 3(0] = £ > 0. 

t 00 

Thus L = 0; that is, 

lim £[/(!), gf(l)] = 0. 

t~* OO 

Since £ is positive definite in D, E(x, y) = 0 if and only if (x, y) = (0, 0). Thus, 

lim £[/(!), 0(1)] = 0 
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if and only if 

lim f(t) = 0 and lim g(t) = 0. 

t-> oo t->ao 

But, from the definition of asymptotic stability of the critical point (0,0), we see that the 
critical point (0,0) of (13.4) is asymptotically stable. 

► Example 13.16 

Consider the system 

dx 
It 

dy 
dt 

and the function E defined by 

E(x 9 y) = x 2 + y 2 9 (13.80) 

previously studied in Examples 13.14 and 13.15. Before that, in Example 13.13, we 
noticed that the function E defined by (13.80) is positive definite in every domain 
containing (0, 0). In Example 13.14, we found the derivative of E with respect to the 
system (13.79) is given by 

E(x 9 y) = -2(x 2 + y 2 ) + 2 (x 2 y + xy 2 ). (13.81) 

Then, in Example 13.15, we found that E defined by (13.81) is negative semidefinite in 
every domain containing (0,0) and hence that E defined by (13.80) is a Liapunov 
function for the system (13.79) in every such domain. Now, applying Theorem 13.10, we 
see that the critical point (0, 0) of (13.79) is stable. 

However, in Example 13.15 we actually showed that E defined by (13.81) is negative 
definite in every domain D containing (0, 0). Thus by Theorem 13.11, we see that the 
critical point (0, 0) of (13.79) is asymptotically stable. 

Note that the asymptotic stability of the critical point (0, 0) of system (13.79) could 
also be determined by applying Theorem 13.8. 

Liapunov’s direct method is indeed “direct” in the sense that it does not require any 
previous knowledge about the solutions of the system (13.4) or the type of its critical 
point (0, 0). Instead, if one can construct a Liapunov function for (13.4), then one can 
“directly” obtain information about the stability of the critical point (0, 0). However, 
there is no general method for constructing a Liapunov function, although methods for 
doing so are available for certain classes of equations. 


-x + y 2 , 


= —y + x 2 , 


(13.79) 


Exercises 

Determine the type and stability of the critical point (0, 0) of each of the nonlinear 
autonomous systems in Exercises 1-6. 

, dx 9 dy . 9 

1. — = x + x 2 - 3 xy, — = — 2x + y + 3 y 2 . 

dt dt 
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2. ^ = x + 4 y + 3x 4 , 
at 

. dx , 

3 . _ = x + y + xy> 

dx 

4. — = y + tan x, 


dy 


dt 

dy 

dt 


= 2x — 3 y — y 2 + 2x 3 . 
= 3 x — y + 2xy 3 , 


dy 

dt 


= y cos x. 


, dx , ., 2 

5. — = (y+ \y-cosx. 


dy 

dt 


= sin(x + y). 


dx i dy 

6. — = x — 2 y + e xy — cos 2 x, — = sin(x + y). 
dt dt 


7. Consider the autonomous system 


dx 

It 


= ye\ 


g-e*-!. 

dt 


(a) Determine the type of the critical point (0, 0). 

(b) Obtain the differential equation of the paths and find its general solution. 

(c) Carefully plot several of the paths found in step (b) to obtain a phase plane 
diagram which clearly exhibits the nature of the critical point (0, 0). 

(d) Obtain the differential equation of the paths of the corresponding reduced 
linear system and find its general solution. 

(e) Proceed as in step (c) for the paths found in step (d). 

(f) Compare the results of the two phase plane diagrams constructed in steps (c) 
and (e). Comment. 

8. Consider the autonomous system 

lOx + x 2 , 

20 y + x 3 , 

and proceed as in Exercise 7. 

9. The differential equation 

d 2 x 

r = x 5 — 5x 4 + 5x 3 + 5x 2 — 6x 
dt 2 


dx 

dt 

dy 

dt 


d 2 x 

is of the form = F(x) and may be considered as the differential equation of a 


nonlinear conservative dynamical system. 


(a) Obtain the equivalent nonlinear autonomous system and find its critical 
points. 
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(b) Obtain the differential equation of the paths in the xy phase plane and show 
that its general solutions is \y 2 + V(x) = h, where 


V(x) = 


F(x) dx. 


o 


(c) Carefully construct the graph of Y = V(x) along with the lines Y = h for 
several well-chosen values of h in an auxiliary xY plane. Then use this 
auxiliary diagram to sketch the paths \y 2 + V(x) = h in the xy phase plane. 

(d) Use the phase plane diagram constructed in part (c) to determine the type of 
each of the critical points found in step (a). Also use the diagram to draw 
conclusions concerning the motion described by the second-order nonlinear 
equation of the problem. 


10. Proceed as in Exercise 9 for the differential equation 


d 2 x 
dt 2 


= sin x (cos x — 1). 


1 1. Proceed as in Exercise 9 for the differential equation 


d 2 x 

IS 


= x 5 — 3x 4 + x 3 + 3x 2 - 2x. 


12. Proceed as in Exercise 9 for the differential equation 

d 2 x 

-^2 = sm x (2 cos * — !)• 


13. Consider the nonlinear system 


dx 

It 


= 6x — y + x 2 


^ = ocx + 2y + y 2 , 


which depends on a parameter a. Assuming that a # — 12, determine the type of 
the critical point (0, 0) of the system as a function of a. 

14. For the system 


dy 

dt 


3x + y + 2x 2 — x 3 , 


(a) find all critical points; 

(b) determine the type of each critical point (a, /?) for which a > 0. 

15. Find the real critical points of each of the following systems, and determine the 
type and stability of each critical point found. 

, . dx A dx 9 

(a) — = x + 4y, — = x + y - x 2 . 



13.3 CRITICAL POINTS AND PATHS OF NONLINEAR SYSTEMS 691 


» dx ? dy 

(b) — = x — y + x 2 , — = 12x — 6 y + xy. 

dt dt 

16. Find the real critical points of each of the following systems, and determine the 
type and stability of each critical point found. 


(a) 


dx 

dt 

dx 


= y — x 


dy 

dt 


= 8x — y 2 . 


(b) ^ = 2x - y + x 2 , 
dt 


dy 

dt 


= -12x + y + y 2 . 


17. Consider the differential equation 

d 2 x 

It 2 


= x 2 — 6x -f- 2, 


where X is a real parameter independent of x. 

(a) Find the critical value X 0 of the parameter X. 

(b) Let X = 5. Find the critical points of the related autonomous system, make a 
qualitative sketch of the paths in the xy phase plane, and determine the type 
of each critical point. 

(c) Let X = X 0 [the critical value determined in part (a)]. Proceed as in part (b). 

18. Consider the differential equation 

d 2 x 2 _ 

77 = x 2 - Xx + 9, 
dt * 


where X is a real parameter independent of x. 

(a) Find the critical values of the parameter X. 

(b) Let X = 10. Find the critical points of the related autonomous system, make 
a qualitative sketch of the paths in the xy phase plane, and determine the 
type of each critical point. 

(c) Let X = X 0 [the positive critical value determined in part (a)]. Proceed as in 
part (b). 

19. For each of the following systems, construct a Liapunov function of the form 
Ax 2 + By 2 , where A and B are constants, and use the function to determine 
whether the critical point (0, 0) of the system is asymptotically stable or at least 
stable. 


(a) 

(b) 

(c) 

(d) 


II 

1^3 

—x + lx 2 

+ y 2 . 

1 

II 

1 w 

y 

+ xy. 

dx 

-x 4- y - 


, dy 



~dt = 

X ~ Xy ’ Jt 


-x-y- 

dx 



dy 


X 

di = 

-x-y- 

X 3 , 

— = x — 
dt 

y 

-y- 

dx 

dt 

— 3x + x 3 

+ 2 xy 2 

II 

- 

■2 y + h 3 . 


_ .,3 
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13.4 LIMIT CYCLES AND PERIODIC SOLUTIONS 


A. Limit Cycles 


We have already encountered autonomous systems having closed paths. For example, 
the system (1 3.67) of Section 1 3.3B has a center at (0, 0) and in the neighborhood of this 
center there is an infinite family of closed paths resembling ellipses (see Figure 1 3.21). In 
this example the closed paths about (0, 0) form a continuous family in the sense that 
arbitrarily near to any one of the closed paths of this family there is always another 
closed path of the family. Now we shall consider systems having closed paths which are 
isolated in the sense that there are no other closed paths of the system arbitrarily near 
to a given closed path of the system. 

What is the significance of a closed path? Given an autonomous system 


t - Qix • ^ 


(13.4) 


one is often most interested in determining the existence of periodic solutions of this 
system. It is easy to see that periodic solutions and closed paths of ( 1 3.4) are very closely 
related. For, in the first place, if x = fft), y = gft), where and g l are not both 
constant functions, is a periodic solution of (13.4), then the path which this solution 
defines is a closed path. On the other hand, let C be a closed path of (13.4) defined by a 
solution x = f(t), y = g(t) of (13.4), and suppose f(t 0 ) = x 0 , g(t 0 ) = y 0 . Since C is 
closed, there exists a value t t = t 0 + T, where T > 0, such that f(t x ) = x 0 , g{t x ) = y 0 . 
Now the pair x = f(t + T), y = g(t + T) is also a solution of (13.4). At t = t 0 , this 
latter solution also assumes the values x = x 0 , y = y 0 . Thus by Chapter 10, Sec- 
tion 10.4A, Theorem 10.5, the two solutions x = f(t), y = g{t) and x = f(t + T), y = 
g(t + T) are identical for all t. In other words, f(t + T) = / (t), g(t + T) = g(t) for all t , 
and so the solution x = f(t ), y = g(t) defining the closed path C is a periodic solution. 
Thus, the search for periodic solutions of ( 1 3.4) falls back on the search for closed paths. 

Now suppose the system (13.4) has a closed path C. Further, suppose (13.4) possesses 
a nonclosed path C 1 defined by a solution x = f(t ), y = g(t) of (13.4) and having the 
following property: As a point R traces out C t according to the equations x = f(t ), 
y = g(t), the path C 1 spirals and the distance between R and the nearest point on the 
closed path C approaches zero either as t -♦ + oo or as t -► — oo. In other words, the 
nonclosed path C i spirals closer and closer around the closed path C either from the 
inside of C or from the outside either as t -+ + oo or as t -► — oo (see Figure 13.29 where 
C x approaches C from the outside). 

In such a case we call the closed path C a limit cycle , according to the following 
definition: 


DEFINITION 

A closed path C of the system (13.4) which is approached spirally from either the inside or 
the outside by a nonclosed path C i of (13.4) either ast^> + oo or as t -► — oo is called a 
limit cycle of (13.4). 
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► Example 13.17 


The following well-known example of a system having a limit cycle will illustrate the 
above discussion and definition. 


= y + x(l - x 2 - y 2 ), 
f t = - X + y(l-x 2 -y 2 ). 


(13.85) 


To study this system we shall introduce polar coordinates (r, 6 ), where 

x = r cos 6 , 
y = r sin 6. 

From these relations we find that 


dx 

Tt 




dt 





dO 

Tt ’ 


(13.86) 


(13.87) 


Now, multiplying the first equation of (13.85) by x and the second by y and adding, we 
obtain 
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Introducing the polar coordinates defined by (13.86), and making use of (13.87), this 
becomes 


4 = ^(i-^). 


For r # 0, we may thus write 


dr 

dt 


= r(l — r 2 ). 


Now multiplying the first equation of (13.85) by y and the second by x and subtracting, 
we obtain 


dx dy 1 1 

n t - x Jr y + *' 


Again using (13.87), this becomes 


and so for r # 0 we may write 


2 d6 2 

— r — = r 

dt ’ 


£—1. 

dt 


Thus in polar coordinates the system (13.85) becomes 

i-*-* 


dO 

Tt 


(13.88) 


= - 1 . 


From the second of these equations we find at once that 

0 = —t + to, 

where t 0 is an arbitrary constant. The first of the equations (13.88) is separable. 
Separating variables, we have 


dr 


= dt , 


r(l — r 2 ) 

and an integration using partial fractions yields 

In r 2 — ln| 1 — r 2 \ = It + ln|c 0 |. 
After some manipulations we obtain 


y2 _ C 0 e 


2 1 


2 1 * 


Thus we may write 


r = 


1 + c 0 e 


— 7 = = where c = — . 

yi + C<r 2 ' c 0 
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Thus, the solution of the system (13.88) may be written 

1 

r ~Vl +£*-*’ 
e = -t + 1 0 , 

where c and t 0 are arbitrary constants. We may choose t 0 = 0. Then 6 = — 1\ using 
(13.86), the solution of the system (13.85) becomes 

cos t 

x = — =====, 
y/\ + ce 21 

_ sin t 
./I + ce~ 2t 

The solutions (13.89) of (13.85) define the paths of (13.85) in the xy plane 
these paths for various values of c, we note the following conclusions: 

1. If c = 0, the path defined by (13.89) is the circle x 2 + y 2 = 1, described in the 
clockwise direction. 

2. If c # 0, the paths defined by (13.89) are not closed paths but rather paths having a 
spiral behavior. If c > 0, the paths are spirals lying inside the circle x 2 + y 2 = 1. As 
t + oo, they approach this circle; while as t -+ — oo, they approach the critical 
point (0, 0) of (13.85). If c < 0, the paths lie outside the circle x 2 + y 2 = 1. These 
“outer” paths also approach this circle as t -* + oo; while as t -► \nj] 7], both |x| 
and | y| becomes infinite. 

Since the closed path x 2 + y 2 = 1 is approached spirally from both the inside and 
the outside by nonclosed paths as t -+ + oo, we conclude that this circle is a limit cycle 
of the system (13.85). (See Figure 13.30). 


(13.89) 

. Examining 


B. Existence and Nonexistence of Limit cycles 

In Example 13.17 the existence of a limit cycle was ascertained by actually finding this 
limit cycle. In general such a procedure is, of course, impossible. Given the autonomous 
system (13.4) we need a theorem giving sufficient conditions for the existence of a limit 
cycle of (13.4). One of the few general theorems of this nature is the Poincare- 
Bendixson theorem, which we shall state below (Theorem 13.13). First, however, we 
shall state and prove a theorem on the nonexistence of closed paths of the system (13.4) 


THEOREM 13.12 Bendixson's Nonexistence Criterion 


Hypothesis. Let D be a domain in the xy plane. Consider the autonomous system 

j,- mx ’ y) ’ 


(13.4) 
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Figure 13.30 


where P and Q have continuous first partial derivatives in D. Suppose that 
dP(x , y) dQ(x , y) 

— 1 has the same sign throughout D. 

ox dy 

Conclusion. The system ( 13.4 ) has no closed path in the domain D. 

Proof. Let C be a closed curve in D; let R be the region bounded by C; and apply 
Green’s theorem in the plane. We have 


[P(x, y) dy - Q(x, y ) dx] = 


dP(x, y) dQ(x, y) 
dx dy 


where the line integral is taken in the positive sense. Now assume that C is a closed 
path of (13.4); let x = f(t\ y = g(t) be an arbitrary solution of (13.4) defining C 
parametrically; and let T denote the period of this solution. Then 

d ^ = P[f(t),g( m, 

— = £[/( t),g(t)l 


along C and we have 


[P(x, y) dy - Q(x, y) dx ] 


P[/(fU(0]^-e[/(0,^)]® 
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Thus 


= £ {pm), g(t)mm m ] - Qim, gwim, g m a% 
= o. 


M 


SP(x, y) + dQ(x, y) 


dx 


dy 


ds = 0. 


But this double integral can be zero only if ^ + dQ(*> y) c | ian g es s jg n# This j s a 

ox dy 

contradiction. Thus C is not a path of (13.4) and hence (13.4) possesses no closed path 
in D. Q.E.D . 


► Example 13. IB 


dx _ . 

* =2 * + > , + *- 


dt 


= 3x - y + }; 3 . 


(13.90) 


Here 

P{x, y) = 2x + y + x 3 , 
Q(x, y) = 3x- y + y 3 , 
and 


SP(x, y) 

dx 


dQ(x>y) 

dy 


= 3(x 2 + y 2 ) + 1. 


Since this expression is positive throughout every domain D in the xy plane, the system 
( 1 3.90) has no closed path in any such domain. In particular, then, the system ( 1 3.90) has 
no limit cycles and hence no periodic solutions. 


Having considered this nonexistence result, we now turn to the Poincare-Bendixson 
existence theorem, the proof of which is outside the scope of this book. We shall merely 
state the theorem and indicate its significance. In order to make the statement of the 
theorem less involved, we introduce two further concepts. 


DEFINITION 

Let C be a path of the system (13.4) and let x = f(t), y = g(t) be a solution of (13.4) 
defining C. Then we shall call the set of all points of C for t >t 0 , where t 0 is some value of 
t , a half-path of (13.4). In other words , by a half-path of (13.4) we mean the set of all 
points with coordinates [/(t), g(t)~\ for t 0 < t < + oo .We denote a half-path of (13.4) by 
C + . 
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DEFINITION 

Let C + be a half-path of (13.4) defined byx = f(t\y = g(0 for t>t 0 . Let (x 1? yj bea 
point in the xy plane. If there exists a sequence of real numbers {t n }, n = 1,2,..., such 
thatt n -+ + 00 aw/ [/(£„), #(£„)] -+(x 1 , y t )asn -* + 00 , then we call (x t , y t ) a limit point 
of C + . The set of all limit points of a half-path C + will be called the limit set 0 / C + and 
wi/Z be denoted by L(C + ). 


► Example 13.19 


The paths of the system (13.85) are given by Equations (13.89). Letting c = 1 we obtain 
the path C defined by 


cos t 


x = 


yr+7^ 

sin t 


(13.91) 


y= — 


yi+e- 2 ' 

The set of all points of C for t > 0 is a half-path C + . That is, C + is the set of all points 
with coordinates 


cos t sin t 

.y/l + e~ 2t 9 + e~ 2t _ ’ 


0 < t < + 00 . 


Consider the sequence 0, 2n, 2nn,..., tending to +00 as n 

corresponding sequence of points on C + is 


+ 00 . The 


cos 2nn sin 2 nn 

_y/l +e“ 4 "’ t ’ y/i Te^J 


(n = 0,1,2,...), 


and this sequence approaches the point (1, 0) as n -> + 00 . 

Thus (1, 0) is a limit point of the half-path C + . 

The set of all limit points of C + is the set of points such that x 2 + y 2 = 1. In other 
words, the circle x 2 + y 2 = 1 is the limit set of C + . 

We are now in a position to state the Poincare-Bendixson theorem. 


THEOREM 13.13 Poincare-Bendixson Theorem; "Strong" Form 


Hypothesis 

1. Consider the autonomous system 

dx 

~dt 


P(x> y)> 


dy 

dt 


Q(* , y\ 


(13.4) 


where P and Q have continuous first partial derivatives in a domain D of the xy plane. Let 
D l be a bounded subdomain of D, and let R denote D x plus its boundary. 
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2. Let C + defined by x = f(t), y = g(t) 9 t > t 0 , be a half-path of (13.4) contained 
entirely in R. Suppose the limit set L(C + ) of C + contains no critical points of (13.4). 

Conclusion. Either (1) the half-path C + is itself a closed path [in this case C + and 
L(C + ) are identical ], or (2) L(C + ) is a closed path which C + approaches spirally from 
either the inside or the outside [in this case L(C + ) is a limit cycle\ Thus in either case , 
there exists a closed path of (13.4) in R. 

A slightly weaker but somewhat more practical form of this theorem may be seen at 
once. If the region R of Hypothesis 1 contains no critical points of (13.4), then the limit 
set L(C + ) will contain no critical points of (13.4) and so the second statement of 
Hypothesis 2 will automatically be satisfied. Thus we may state: 


THEOREM 13.13A Poincare-Bendixson Theorem; "Weak" Form 
Hypothesis 

1. Exactly as in Theorem 13.13. 

2. Suppose R contains no critical points of (13.4). 

Conclusion. If R contains a half-path of (13.4), then R also contains a closed path of 
(13.4). 

Let us indicate how this theorem may be applied to determine the existence of a 
closed path of (13.4). Suppose the continuity requirements concerning the derivatives 
of P(x, y) and Q(x , y) are satisfied for all (x, y). Further suppose that (13.4) has a critical 
point at (x 0 , y 0 ) but no other critical points within some circle 

K:(x- x 0 ) 2 + (y - y 0 ) 2 = r 2 

about (x 0 , y 0 ) (see Figure 13.31). Then an annular region whose boundary consists of 
two smaller circles K t :(x - x 0 ) 2 + (y - y 0 ) 2 = r? and K 2 :(x - x 0 ) 2 + (y — y 0 ) 2 = 
r 2 , where 0 < r t < r 2 < r, about (x 0 , y 0 ) may be taken as a region R containing no 
critical points of (13.4). If we can then show that a half-path C + of (13.4) (for t > some 
t 0 ) is entirely contained in this annular region R , then we can conclude at once that a 
closed path C 0 of (13.4) is also contained in R. 

The difficulty in applying Theorem 13.13A usually comes in being able to show that 
a half-path C + is entirely contained in R. If one can show that the vector 
[P(x, y), Q(x , y)] determined by (13.4) points into Pat every point of the boundary of R , 
then a path C entering R at t = t 0 will remain in R for t > t 0 and hence provide the 
needed half-path C + . 


► Example 13.20 

Consider again the system (13.85) with critical point (0,0). The annular region R 
bounded by x 2 + y 2 = * and x 2 + y 2 = 4 contains no critical points of (13.85). If we 
can show that R contains a half-path of (13.85), the Poincare-Bendixson theorem 
(“weak” form) will apply. 



X 


Figure 13.31 


In our previous study of this system we found that 

fo r r > 0, w here r = ^/x 2 + y 2 . On the circle x 2 + y 2 = dr/dt > 0 and hence r = 
x /* 2 +y 2 is increasing. Thus the vector [P(x, y), Q(x , y)] points into R at every poi nt 
of this inner circle. On the circle x 2 + y 2 = 4, dr/dt < 0 and hence r = ^Jx 2 + y 2 is 
decreasing. Thus the vector [P(x, y), Q(x 9 y)] also points into R at every point of this 
outer circle. (See Figure 1 3.32, in which this situation is illustrated qualitatively.) Hence 
a path C entering jR at t = t 0 will remain in R for t > t 0 , and this provides us with the 
needed half-path contained in R. 

Thus by the Poincare-Bendixson theorem (“weak” form), we know that R contains a 
closed path C 0 . We have already seen that the circle x 2 + y 2 = 1 is indeed such a closed 
path of (13.85). 


C. The Index of a Critical Point 

We again consider the system (13.4), where P and Q have continuous first partial 
derivatives for all (x, y), and assume that all of the critical points of (13.4) are isolated. 
Now consider a simple closed curve* C [not necessarily a path of (13.4)] which passes 


* By a simple closed curve we mean a closed curve having no double points; for example, a circle is a 
simple closed curve, but a figure eight is not. 
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Figure 13.32 





>- 

X 


Figure 13.33 


through no critical points of (13.4). Consider a point (x 1? j/J on C and the vector 
[P(x 1? y t ), Q(x 1? yj] defined by (13.4) at the point (x 1? y t ). Let 6 denote the angle from 
the positive x direction to this vector (see Figure 13.33). 

Now let (x 1? describe the curve C once in the counterclockwise direction and 
return to its original position. As (x r , y x ) describes C, the vector [P(x 1? yj, 6(x 1? yj] 
changes continuously, and consequently the angle 6 also varies continuously. When 
(xi, y t ) reaches its original position, the angle 6 will have changed by an amount A 6. 
We are now prepared to define the index of the curve C. 
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DEFINITION 

Let 6 denote the angle from the positive x direction to the vector [ P(x l9 yj, Q(x Vi y J] 
defined by (13.4) at (x 1? y t ). Let A 6 denote the total change in 6 as (x b yj describes the 
simple closed curve C once in the counterclockwise direction. We call the number 


the index of the curve C with respect to the system (13.4). 

Clearly A 6 is either equal to zero or a positive or negative integral multiple of 2n and 
hence / is either zero or a positive or negative integer. If [ P(x l9 y t ) 9 Q(x u yj] merely 
oscillates but does not make a complete rotation as (x t , yj describes C, then I is zero. If 
the net change A 6 in 6 is a decrease, then / is negative. 

Now let (x 0 , y 0 ) be an isolated critical point of (13.4). It can be shown that all simple 
closed curves enclosing (x 0 , y 0 ) but containing no other critical point of (13.4) have the 
same index. This leads us to make the following definition. 


DEFINITION 

By the index of an isolated critical point (x 0 , y 0 ) of (13.4) we mean the index of a simple 
closed curve C which encloses (x 0 , y 0 ) but no other critical points of (13.4). 

From an examination of Figure 13.34 we may reach the following conclusion 
intuitively: The index of a node, a center, or a spiral point is + 1, while the index of a 
saddle point is — 1. 

We now list several interesting results concerning the index of a simple closed curve 
C and then point out several important consequences of these results. Most of these 
results are intuitively obvious, but few are easy to prove rigorously. In each case when 
we say index we shall mean the index with respect to the system (13.4) where P(x, y) and 
Q(x , y) have continuous first partial derivatives for all (x, y) and (13.4) has only isolated 
critical points. 

1. The index of a simple closed curve which neither passes through a critical point of 
(13.4) nor has a critical point of (13.4) in its interior is zero. 

2. The index of a simple closed curve which surrounds a finite number of critical 
points of (13.4) is equal to the sum of the indices of these critical points. 

3. The index of a closed path of (13.4) is + 1. 

From these results the following conclusions follow as once. 

(a) A closed path of (13.4) contains at least one critical point of (13.4) in its interior 
[for otherwise, by (1), the index of such a closed path would be zero; and this would 
contradict (3)]. 

(b) A closed path of (13.4) may contain in its interior a finite number of critical 
points of (13.4), the sum of the indices of which is + 1 [this follows at once from (2) and 

( 3 )]. 
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Nodes 





Center 



Spiral Point 



Saddle 


Figure 13.34 


D. The Lienard-Levinson-Smith Theorem and the van der Pol Equation 

Throughout this chapter we have stated a number of important results without proof. 
We have done this because we feel that every serious student of differential equations 
should become aware of these results as soon as possible, even though their proofs 
are definitely beyond our scope and properly belong to a more advanced study of our 
subject. In keeping with this philosophy, we close this section by stating without proof 
an important theorem dealing with the existence of periodic solutions for a class of 
second-order nonlinear equations. We shall then apply this theorem to the famous van 
der Pol equation already introduced at the beginning of the chapter. 


THEOREM 1 3.1 4 Lienard-Levinson-Smith 

Hypothesis. Consider the differential equation 

d^x dx 

lt T + f(x) l^ + g(x) = 0 ’ (1192) 

where /, g , F defined by F(x) = f(u) du 9 and G defined by G(x) = J* g(u) du are real 
functions having the following properties : 

1. f is even and is continuous for all x. 

2. There exists a number x 0 > 0 such that F(x) < 0 for 0 < x < x 0 and F(x) > 0 and 
monotonic increasing for x > x 0 . Further , F(x) -> co as x co. 
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3. g is odd , has a continuous derivative for all x, and is such that g(x) > 0 for x > 0. 

4 . G(x) — * oo as x — * oo. 

Conclusion. Equation (13.92) possesses an essentially unique nontrivial periodic 
solution. 


Remark. By “essentially unique” in the above conclusion we mean that if x = 0(t) is 
a nontrivial periodic solution of (13.92), then all other nontrivial periodic solutions of 
(13.92) are of the form x = (f)(t — t t ), where t t is a real number. In other words, the 
equivalent autonomous system 

dx 


dy 

dt 


-f(x)y ~ g(x\ 


has a unique closed path in the xy plane. 

One of the most important examples of an equation of the form (13.92) which 
satisfies the hypotheses of Theorem 13.14 is the van der Pol equation 


d 2 x , , dx 
W + « x -D^ + x.O, 


where p is a positive constant. Here /(x) = p(x 2 


F(x) = 


1), g(x) = x, 

3 


p(u 2 -l)du = p[ — -x 


and 


(13.2) 


G(x) = 


udu = 


We check that the hypotheses of Theorem 13.14 are indeed satisfied: 


1. Since /( — x) = p(x 2 — 1) = /(x), the function / is even. Clearly it is continuous for 
all x. 

2. F(x) = p(x 3 /3 — x) is negative for 0 < x < y/3. For x > ^/3, F(x) is positive and 
monotonic increasing (it is, in fact, monotonic increasing for x > 1). Clearly 
F(x) -► oo as x -► oo. 

3. Sincegf( — x)= — x = —g(x), the function^ is odd. Since g'(x) = 1, the derivative of 
g is continuous for all x. Obviously g(x) > 0 for x > 0. 

4. Obviously G(x) -> oo as x -> oo. 

Thus the conclusion of Theorem 13.14 is valid, and we conclude that the van der Pol 
equation (13.2) has an essentially unique nontrivial periodic solution. In other words, 
the equivalent autonomous system 


dx 

Tt =y ' 

^ = )U(1 - x 2 )y - x, 


(13.93) 


has a unique closed path in the xy plane. 
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Figure 13.35 


The differential equation of the paths of the system (13.93) is 

Ay _ ju(l — x 2 )y — x 
dx y 


(13.94) 


Using the method of isoclines (Section 8. 1 B) one can obtain the paths defined by ( 1 3.94) 
in the xy plane. The results for n = 0.1, ju = 1, and n = 10 are shown in Figure 13.35a, 
13.35b, and 13.35c, respectively. The limit cycle C in each of these figures is the unique 
closed path whose existence we have already ascertained on the basis of Theo- 
rem 13.14. For fi = 0.1 we note that this limit cycle is very nearly a circle of radius 2. 
For ju = 1, it has lost its circle-like form and appears rather “baggy,” while for // = 10, 
it is very long and narrow. 
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Exercises 


1. Consider the nonlinear autonomous system 

dx 


— 1 = 4x — 4y — x(x 2 + y 2 ), 
at 


dy 

dt 


= 4x + 4y — y(x 2 -f y 2 ). 


(a) Introduce polar coordinates (r, 6) and transform the system to polar 
coordinate form. 

(b) Apply the Poincare-Bendixson theorem to show that there exists a limit cycle 
between the circles 

x 2 + y 2 = i and x 2 + y 2 = 16. 

(c) Find the explicit solutions x = /(t), y = g(t) of the original system. In 
particular, find “the” periodic solution corresponding to the limit cycle whose 
existence was ascertained in step (b). 

(d) Sketch the limit cycle and at least one of the nonclosed paths. 


2. Proceed as in Exercise 1 for the system 

x 


dx 

— = y + , 

dt V * 2 +y 2 


dy 

-f = -x +■ 

dt Jx 1 + y 


. [1 - (x 2 + y 2 )l 

y n , ,2\ 


[1 - (x 2 + y 2 )]. 


Consider the nonlinear differential equations in Exercises 3-7. In each case apply one 
of the theorems of this section to determine whether or not the differential equation has 
periodic solutions. 

d 2 x 


3. 

4. 

5. 

6 . 

7. 


dt : 


+ x* 


dx dx 


h X — 0. 


dt dt 


d 2 x dx 
dt 2 + dt + 


dA 3 

Tt) +x - 0 ' 


d 2 x 

Tt 

d 2 x 

dt 

d 2 x 

dt 2 


2 + (x 4 + X 2 ) ^ + (x 3 + x) = 0. 


2 +(5x 4 -6x 2 )^ + x 3 = 0. 


dx 

It 


+ (x 2 + l)^ + x 3 = 0. 


8. Consider the nonlinear autonomous system 

dx 


dt 


= x — x 


dy 0 2 
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(a) Locate, classify (as to type), and list the index of each of the critical points. 

(b) Find the equation of the paths of the system in the xy phase plane and sketch 
several of these paths. 

(c) Determine the index of each of the following simple closed curves with respect 
to the given system. In each case explain how the index was determined. 

(i) 4(x 2 + y 2 ) = 1. 

(ii) 100(x 2 + y 2 ) = 441. 

(iii) x 2 + y 2 = 9. 

(iv) 4(x 2 + y 2 ) — 8(x + 2y) +11= 0. 


13.5 THE METHOD OF KRYLOFF AND BOGOLIUBOFF 

A. The First Approximation of Kryloff and Bogoliuboff 

In this section we consider a method of finding an approximate solution of an equation 
of the form 

d 2 x ~ ( dx\ „ 

-jp- + ( 0 2 x + nf lx, — J = 0, (13.95) 

where ju is a sufficiently small parameter so that the nonlinear term jjf(x,dx/dt) is 
relatively small. The method which we shall discuss is due to the Russian scientists 
Kryloff and Bogoliuboff, and it is basically a method of variation of parameters. 

If fi = 0 in Equation (13.95), then Equation (13.95) reduces to the linear equation 

+ co 2 x = 0. (13.96) 

The solution of Equation (13.96) may be written 

x = a sin (cot -f (/>), (13.97) 

where a and </> are constants. The derivative of the solution (13.97) of (13.96) is 

dx 

— = aco cos (cot + </>). (13.98) 

dt 

If fi # 0 but sufficiently small (that is, | ju | « 1), one might reasonably assume that the 
nonlinear equation (13.95) also has a solution of the form( 13.97) with derivative of the 
form (13.98), provided that a and </> now be regarded as functions of t rather than 
constants. This is precisely what we shall do in applying the Kryloff-Bogoliuboff 
method. That is, we assume a solution of (13.95) of the form 

x = a(t) sin[cot + 0(f)], (13.99) 

where a and 0 are functions of t to be determined, such that the derivative of the 
solution (13.99) is of the form 


— = coa(t) cos [cot + ( p(t )]. 
dt 


(13.100) 
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Differentiating the assumed solution (13.99), we obtain 

= coa(t ) cos [cot + 0(t)] + a (t) -j- cos [cot + 0(0] 


+ — sin[cot + 0(f)]. (13.101) 

at 

In order for dx/dt to be of the desired form (13.100), we see from (13.101) that we must 
have 


a(t) cos [cor + 0(f)] + ^ sin [cor -f 0(r)] = 0. 
dt dt 


(13.102) 


Now differentiating the assumed derivative (13.100), we obtain 

— -r = — co 2 o(r) sin[cor -f 0(r)] — coa(r)^ sin[cor H 
dt 1 dt 


da 

+ co — cos [cor + 0(r)]. 
dt 


(13.103) 


Substituting the assumed solution (13.99), its assumed derivative (13.100), and the 
second derivative given by (13.103) into the differential equation (13.95), we obtain 

— co 2 a(t) sin[cor + 0(r)] - coa(r)^ sin [cot + 0(t)] 


dt 


da 


+ co -j- cos [cor -f 0(r)] + co 2 a(r) sin[cor + 0(r)] 
dt 

+ nf{a(t) sin[cor + 0(r)], coa(r) cos [cor + 0(r)]} = 0 


or 


co ^ cos[c or + 0(r)] — coa(t) sin[cor + 0(r)] 
dt dt 

= -ju/{a(r)sin[cor + 0(r)], coa(r)cos[cor + 0(r)]}. (13.104) 

Let 6(t) denote cor + 0(t). Then Equations (13.102) and (13.104) may be written 

da deb 

sin 6(t) — + a(t) cos 0(t) — = 0 
dt dt 


co cos 6(t) ^ — coa(t) sin 6(t) = ~nf[a(t) sin 0(t), coa(r) cos 0(t)]. 

dt dt 


(13.105) 


Solving the system (13.105) for da/dt and dcj)/dt, we obtain the following equations: 


^ = — — f[a(t) sin 6(t ), coa(r) cos 0(t)] cos 0(t), 

dt CD 




# 

dt coa(t) 


(13.106) 


f[a(t) sin 0(t), coa(t) cos 0(t)] sin 0(t). 


These are the exact equations for the functions a and 0 in the solution of the form 
(13.99) with derivative of the form (13.100). Note that they are nonlinear and 
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nonautonomous and hence are quite complicated, to say the least. In order to make 
these equations tractable, we now apply the first approximation of Kryloff and 
Bogoliuboff. 

From Equations (13.106) we see that da/dt and d(j)/dt are both proportional to the 
small parameter p. Hence although a and are functions of t , they are slowly varying 
functions of t during a period T = 2n/co of t. That is, in a time interval of length 2n/oo, 
a(t) and fi(t) are almost constant, while 9(t) = cot + 0(r) increases by approximately 2 tc. 
Thus in the right members of (13.106) we regard a(t) and 0(t) as constant during an 
interval of 2n in 0 and then replace the functions of the right members of (13.106) by 
their mean values over such an interval in 0 . This leads to the equations 


da 

It 

d(f> 

~di 


2nco 


2ncoa 


i 

r 


f(a sin 0 , coa cos 6) cos 9 d9 , 


f(a sin 9 , coa cos 9) sin 9 d9. 


(13.107) 


These are the equations of the first approximation of Kryloff and Bogoliuboff for the 
functions a and </> of the solution (13.99). We hasten to point out that we have not 
rigorously justified this procedure. Assuming that the procedure is justifiable for the 
problem under consideration, a first approximation to the solution of Equation ( 1 3.95) 
is thus given by 

x = a(t) sin[cut + 0(f)], (13.99) 


where a(t) and </>(r ) are determined by Equations (13.107). 


B. Two Special Cases 


Case 7. The term /(x, dx/dt) depends upon x only. If /(x, dx/dt) in Equation (13.95) 
depends upon x only, then the equation reduces to 


d 2 x 

di 2 


+ co 2 x + pf(x) = 0. 


(13.108) 


Then f(a sin 9 , coa cos 9) becomes simply f(a sin 0), and the Equations (13.107) 
reduce at once to 


da 

di 

# 

dt 


— — I f(a sin 9) cos 9 d9 , 
2nco Jo 

— — — I f(a sin 9) sin 9 d9. 
2ncoa J 0 


(13.109) 


Consider the integral in the right member of the first equation of (13.109). Letting 
u = a sin 0, we have 

f 2 * 1 f° 

f(a sin 0) cos 0 d9 = - f(u) du = 0. 

Jo a Jo 

Thus from the first equation of ( 1 3. 1 09) we see that da/dt = 0, and so the amplitude a(t) 
is a constant a 0 . The second equation of (13.109) now reduces to 
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where 

F(a 0 ) = - - I f(a 0 sin 6) sin 6 d6. (13.111) 

2ncoa 0 J 0 

Thus from (13.1 10), </>(t ) = F(a 0 )t + </> 0 , where </> 0 is a constant of integration. 

Hence the first approximation to the solution of (13.108) is given by 

x = a 0 sin{[F(a 0 ) + co]t + </> 0 }, (13.112) 

where the constant F(a 0 ) is given by (13.1 1 1). 

Therefore in the special case (13.108) in which the nonlinear term /(x) depends upon 
x only (and not on dx/dt), the first approximation to a solution is a periodic oscillation 
the period of which depends upon the amplitude. 


Case 2. The term /(x, dx/dt) depends upon dx/dt only . In this case Equation (13.95) 
reduces to 


d 2 x 2 
-^- + <0 x + lif 


= 0 . 


(13.113) 


Thus f(a sin 6 , coa cos 0) becomes simply f(coa cos 6 ), and Equations (13.107) now 
reduce to 


da 

~dt 


F 

2nco 


*2n 

f(coa cos 6) cos 6 d6, 
o 


d$ 

dt 


- — f / (coa cos 6) sin 6 d6. 
2na>a J 0 


(13.114) 


Letting u = coa cos 6 in the integral in the right member of the second equation of 
(13.114), we have 


"2 n 

Jo 


f(a)a cos 6) sin 6 d6 


=--r 

<»a J m 


f (u) du = 0. 


Thus from the second equation of (13.114) we see that d(f)/dt = 0, and so (f>(t ) is a 
constant 0 O . 

Hence the first approximation to the solution of (13.113) is given by 


x = a(t) sin[cot + 0 O ], (13.115) 

where a(t) is given by the first equation of (13.1 14). 

Therefore in the special case (13.1 13) in which the nonlinear term f (dx/dt) depends 
upon dx/dt only (and not on x), the first approximation to a solution is a nonperiodic 
oscillation of variable “amplitude” having the same frequency (oo/2n) as that of 
solution (13.97) of the related linear equation (13.96). 


C. Examples 

We now apply the equations of the first approximation of Kryloff and Bogoliuboff to 
several differential equations of the form (13.95). 



13.5 THE METHOD OF KRYLOFF AND BOGOLIUBOFF 711 


► Example 13.21 


Consider the differential equation 

d 2 x 

dt 2 


+ co 2 x + fix 3 = 0, 


(13.116) 


where /x is a small parameter. Here f(x,dx/dt) = /(x) = x 3 , and hence Equa- 
tion (13.116) is of the special form (13.108) in which the nonlinear term depends 
upon x only. Thus the first approximation to the solution of (13.116) is given by 
(13.112), where F(a 0 ) is determined by (13.111). Since f(a 0 sin 9) = c*o sin 3 9 , we see 
from (13.111) that 

*2n 


F(a 0 ) = 


Incocin 


al sin 3 9 sin 6 d6 


/fgo 

2nco 


o 2n(o \ 4 / 8co 


Thus the first approximation to the solution of Equation (13.1 16) is given by 


. ( 3fidl \ 

= d 0 sin I -g— + co If + , 


(13.117) 


where d 0 (the amplitude) and </> 0 are arbitrary constants. Observe that the period 

2n 


8co 


+ co 


of the oscillation (13.117) is a function of the amplitude. 

► Example 1 3.22 


Consider the differential equation 

d 2 x 2 
-r-Y + orx -f )U 
dt 2 

where fi is a small parameter. Here 

dx\ Jdx 


dx\ 2 ( dx\ 3 

+ '^j j 


dt 


= 0 , 


(13.118) 


dt 


dt 


f =/ = = = +[=■), 


dx 


dt 


dx 


dt 


and hence Equation (13.1 18) is of the special form (13.1 13) in which the nonlinear term 
depends upon dx/dt only. Thus the first approximation to the solution of (13.118) is 
given by (13.115), where the amplitude d(t) is determined from the first equation of 
(13.114). Since /(coa cos 9) = co 2 d 2 cos 2 9 + co 3 a 3 cos 3 9 , this equation is 

f 2 * 

(w 2 d 2 cos 2 9 + co 3 d 3 cos 3 9) cos 9 d9 


dd 

Tt 


_V_ 

2nco 


or 


dd 

di 


fiCDd * 


2n 



'2n 

cos 3 9 d9 + tod 

cos 4 9 d9 

LJ 

o 

0 
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Since 


2 n 


cos 3 6 d6 = 0 and 


r 


2 n 3 7C 

cos 4 6 d6 = — , 

4 


this reduces at once to 


da 

dt 


3 fia> 2 a 3 
8 


(13.119) 


Separating variables in (13.119) and integrating, we obtain 

1 3 LUO 2 

2a 1 = 8 1 + C ’ 

where c is an arbitrary constant. If a(0) = a 0 , then c = \/2al. We thus have 


2 a 


1 3 nco 2 1 

+ 2^’ 


and so 


a = 2 


«0 


v 4 + 3fio) 2 alt' 

Thus the first approximation to the solution of Equation (13.1 18) is given by 


x = 2 


sin[cot + </> 0 ]. 


f 4 + 3fi(jo 2 alt 

Assuming ju > 0, the amplitude of this oscillation tends to zero as t -► oo. 


► Example 13.23 


Consider the van der Pol equation 


d 2 x 2 

w + 


,, dx 

D- + X- 0 , 


(13.2) 


where 0 < ju « 1. This equation is of neither of the special forms (13.108) or (13.113) 
considered above but rather of the more general form (13.95), where co 2 = 1 and 


dx 


dt 


dx 


/ x,^ =(x>- 1) = 


dt 


Therefore the first approximation to the solution of (13.2) is given by (13.99), where a(t) 
and (f)(t ) are determined by Equations (13.107) and where co = 1. 

Since f(a sin 6 , a cos 6) = (a 2 sin 2 6 — 1 )a cos 0, the Equations (13.107) become 


da 

di 

# 

dt = 


= -jl r 

2n Jo 


(i a 2 sin 2 6 — l)a cos 2 6 dO, 


2na 


(a 2 sin 2 6 — 1 )a cos 6 sin 6 d6. 
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Since 


f*2n 

JO 


r 


sin 2 6 cos 2 6 d6 = y, 
o 4 


2n 


cos 2 6 d6 = 7E, 


sin 3 9 cos 6 d6 = 0, and 


o 

2tc 


cos 6 sin 6 d6 = 0, 


these equations reduce at once to 


da /ia 

A = T 




(13.120) 


From the second of Equations (13.120) we see at once that </>(t) = </> 0 , a constant. 
Thus by (13.99) the first approximation to the solution is given by 

x = a(t) sin(f + </> 0 ), 


where a(t) is to be found from the first of Equations (13.120). Separating variables in 
this equation we obtain 


da 


a(a 2 - 4) 


= -^dt. 


Integrating and simplifying we find that 

a 2 — 4 


(13.121) 


and hence 


a 1 = 


1 — ce^ 1 


If a{ 0) = a 0 > 0, then c = (a% — 4 )/a% and hence 


a 2 = 


4 aie" 




Thus the first approximation to the solution of the van der Pol equation (13.2) for 
0 < fi « 1 is given by 

aoe * 12 

x = r ---°, = sin(f + (U (13.122) 


where a 0 is the initial value of the amplitude. 

If a 0 = 2, the first approximation (13.122) reduces to the periodic function defined by 


x = 2 sin(t + <j ) 0 ) 


(13.123) 


with constant amplitude 2. 



714 


NONLINEAR DIFFERENTIAL EQUATIONS 


If 0 < a 0 < 2 or if a 0 > 2, we see that 


lim a(t) = lim 

t-> 00 t-> 00 



= 2 . 


Thus for any finite a 0 > 0 except a 0 = 2 the nonperiodic oscillation given by (13.122) 
tends to the periodic oscillation given by (13.123) as t -> oo. 

Now we must remember that the formula (13.122) is only a first approximation to 
the exact solution of Equation (13.2). Hence our above observations give only an ap- 
proximate appraisal of the actual situation. Nevertheless, if 0 < p « 1, they indicate 
the actual situation quite well. Equation (13.2) has only one type of nontrivial periodic 
solutions, and these periodic solutions are given approximately by (13.123). All other 
nontrivial solutions of Equation (13.2) are nonperiodic oscillations which tend to one 
of the aforementioned periodic solutions as t -> oo. These nonperiodic solutions are, of 
course, given approximately by (13.122). 

In the previous section of this chapter we studied the phase plane paths associated 
with the van der Pol equation (13.2). We recall that there is a unique closed path, a limit 
cycle which all nonclosed paths approach as t -> oo. For 0 < p « 1, this limit cycle is 
approximately a circle of radius 2, and the nonclosed paths approach this nearly 
circular path spirally as t -► oo. The limit cycle is represented approximately by the 
periodic oscillations (13.123) above, while the spiral paths are represented approx- 
imately by the nonperiodic oscillations (13.122). 


Exercises 


Use the method of Kryloff and Bogoliuboff to find a first approximation to the 
solution of each of the differential equations in Exercises 1-6. In each case assume that 
0 < p « 1 and that the initial value of the “amplitude” a(t) of the “solution” is given by 
a 0 > 0. 

d 2 x 

~df 2 


U A - <- 

1. -jy + ( 0 2 x + px 5 = 0. 


2 . 

3. 

4. 


dx 

TT + X + fi — 

dt 2 dt 


d 2 x 

It 

d 2 x 

dt 

d 2 x 

dt 


dx 

dt 


= 0 . 


2 +/* 


/ dx\ 3 . 

hr +* 

_\dt ) 


+ co 2 x = 0 . 


2 + co 2 x + fi{x 2 + 1)^ = 0. 


dx 

It 


d 2 x 


5. -^ + co 2 x + /dx 3 +— =0. 


6 . 


dt 2 

d 2 x 

~dl 


dx 


dt 


+ x+n(^x*-l 


dx 

di 


= 0 . 



-CHAPTER FOURTEEN= 

Partial Differential Equations 


In the previous chapters of this text we have concentrated upon ordinary differential 
equations. In this final chapter we shall give a brief introduction to partial differential 
equations. The subject of partial differential equations is a vast one, and an entire book 
of this size could be devoted to an introduction to it. In this short chapter we shall 
merely introduce certain basic concepts of partial differential equations and present a 
basic method of solution which is very useful in many applied problems. 


14.1 SOME BASIC CONCEPTS AND EXAMPLES 


A. Partial Differential Equations and Their Solutions 

Let us first recall that a partial differential equation is a differential equation which 
involves partial derivatives of one or more dependent variables with respect to one or 
more independent variables. We shall say that a solution of a partial differential 
equation is an explicit or implicit relation between the variables which does not contain 
derivatives and which identically satisfies the equation. In certain very simple cases a 
solution can be obtained immediately. 

For example, consider the first-order partial differential equation 

d E = x 1 + y 2 , (14.1) 

OX 

in which u is the dependent variable and x and y are independent variables. We have 
already solved equations of this type in Chapter 2. The solution is 

« = f (x 2 + y 2 ) dx + <t>(y). 
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where J (x 2 + y 2 ) dx indicates a “partial integration” with respect to x, holding y 
constant, and is an arbitrary function of y only. Thus the solution of Equation (14.1) 
is 


w = y + *y 2 + <t>(y ), 


(14.2) 


where </> is an arbitrary function of y. 

As a second example, consider the second-order partial differential equation 


d 2 u 
dy dx 


= x 


y- 


(14.3) 


We first write this in the form 


d ( du\ . 

and integrate partially with respect to y, holding x constant, to obtain 

du 3 y 2 


where <l> is an arbitrary function of x. We now integrate this result partially with respect 
to x, holding y constant, and so obtain the solution of Equation (14.3) in the form 


x*y 


xy 


« = x- + fix) + g(y). 


(14.4) 


where / defined by /(x) = J 4>(x) dx is an arbitrary differentiable function of x and g is 
an arbitrary function of y. 

As a result of these two simple examples, we note that whereas ordinary differential 
equations have solutions which involve arbitrary constants , partial differential 
equations have solutions which involve arbitrary functions. In particular, we note that 
the Solution (14.2) of the first-order partial differential equation (14.1) contains one 
arbitrary function, and the solution (14.4) of the second-order partial differential 
equation (14.3) contains two arbitrary functions. 

Let us now consider the nature of a typical mathematical problem which involves a 
partial differential equation and which originated in the mathematical formulation of 
some physical problem. Such a problem involves not only the partial differential 
equation itself, but also certain supplementary conditions (so-called boundary con- 
ditions or initial conditions or both). The number and nature of these conditions depend 
upon the nature of the physical problem from which the mathematical problem 
originated. The solution of the problem must satisfy both the partial differential 
equation and these supplementary conditions. In other words, the solution of the 
whole problem (differential equation plus conditions) is a particular solution of the 
differential equation of the problem. In Section 2 of this chapter we shall consider a 
basic method which may be employed to obtain this particular solution in certain cases. 
First, however, we consider briefly the class of partial differential equations which most 
frequently occurs in such problems. This is the class of so-called linear partial 
differential equations of the second order. 
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B. Linear Partial Differential Equations of the Second Order 


The general linear partial differential equation of the second order in two independent 
variables x and y is an equation of the form 


A d 2 u d 2 u 3 2 u ^ du du 

A -t — 2 + B — — - — h C -t-j + D — — h E — — h Fu = G, 
ox ox oy oy ox oy 


(14.5) 


where A, B , C, D , F, F, and G are functions of x and y. If G(x, y) = 0 for all (x, y ), 
Equation (14.5) reduces to 


, d 2 u n d 2 u ^ d 2 u ^ du ^ du 
A -z~2 + B — — - — h C — 2 + D — f E — — h Fu = 0. 
ox ox oy oy ox oy 


(14.6) 


Concerning solutions of Equation (14.6), we state the following basic theorem, which is 
analogous to Theorem 11.18 for linear ordinary differential equations. 


THEOREM 14.1 

Hypothesis. Let fi,f 2 ,--“>f n ben solutions of Equation (14.6) in a region R of the xy 
plane. 

Conclusion. The linear combination c l f i + c 2 / 2 + *** + c„/„, where c 1 ,c 2 ,...,c w 
are arbitrary constants , is also a solution of Equation (14.6) in the region R. 

Extending this, it may be shown that the following theorem is also valid. 


THEOREM 14.2 


Hypothesis 

1. Let fi, f 2 >-> fm-- be an infinite set of solutions of Equation (14.6) in a region R 
of the xy plane. 

2. Suppose the infinite series 

Z fn = /l + f 2 + ' ’ ‘ + fn + " * 
n= 1 

converges to f in R , and suppose this series may be differentiated term by term in R to 
obtain the various derivatives (of f) which appear in Equation (14.6). 


Conclusion. 

(14.6) in R. 


00 

The function f (defined by f = £ f„) is also a solution of Equation 

n = 1 


We now consider an important special class of second-order linear partial 
differential equations, the so-called homogeneous linear equations of the second order 
with constant coefficients. An equation of this class is of the form 


d 2 u d 2 u 
a dx 2 + dx dy 



= 0 , 


(14.7) 
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where a , b , and c are constants. The word homogeneous here refers to the fact that all 
terms in (14.7) contain derivatives of the same order (the second). 

We shall seek solutions of (14.7) of the form 

u = f(y + mx )> (14.8) 

where / is an arbitrary function of its argument (having sufficient derivatives)* and m is 
a constant. Differentiating (14.8), we obtain 

d 2 u 2 

= m f (y + mx )» 

= (i«) 

d 2 u 

— 2 = f{y + mx). 

Substituting (14.9) into Equation (14.7), we obtain 

am 2 f"(y + mx) + bmf'(y + mx) + cf"(y + mx) = 0 
or 

f"(y + mx)[am 2 + bm + c] = 0. 

Thus f(y + mx) will be a solution of (14.7) if m satisfies the quadratic equation 

am 2 + bm + c = 0. (14.10) 

We now consider the following four cases of Equation (14.7). 

1. a # 0, and the roots of the quadratic equation (14.10) are distinct. 

2. a # 0, and the roots of (14.10) are equal. 

3. a = 0, b # 0. 

4. a = 0, b = 0, c 7^ 0. 

In Case 1 let the distinct roots of (14.10) be m x and m 2 . Then Equation (14.7) has the 
solutions 

/(y + m^) and g(y + m 2 x), 

where / and g are arbitrary functions of their respective arguments. Applying 
Theorem 14.1, we see that 

f(y + rn x x) + g(y + m 2 x) (14.1 1) 

is a solution of Equation (14.7). 

In Case 2, let the double root of (14.10) be m t . Then Equation (14.7) has the solution 
f(y + m x x), where / is an arbitrary function of its argument. Further, it can be shown 
that in this case Equation (14.7) also has the solution xg(y + m t x), where g is an 


In this chapter, whenever we state that a solution is an "arbitrary" function of its argument, we shall 
always mean an arbitrary function having a sufficient number of derivatives to satisfy the differential 
equation. 
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arbitrary function of its argument. Hence by Theorem 14.1 we see that 

f{y + m t x) + xg(y + m t x) (14.12) 

is a solution of (14.7) 

In Case 3, the quadratic equation (14.10) reduces to bm + c = 0 and hence has only 
one root. Denoting this root by m l9 Equation (14.7) has the solution f(y + m 1 x) 9 where 
/ is an arbitrary function of its argument. Further, it can be verified at once that in this 
case g(x) 9 where g is an arbitrary function of x only, is also a solution of Equation (14.7). 
Thus by Theorem 14.1, we see that 


f(y + m t x) + g(x) 


(14.13) 


is a solution of (14.7). 

Finally, in Case 4, Equation ( 1 4. 1 0) reduces to c = 0, which is impossible. Thus in this 
case there exist no solutions of the form (14.8). However, in this case the differential 
equation (14.7) is simply 


c 


8 2 u 

dy 2 


= 0 


or 


d_(du\ 
dy / 


= 0 . 


Integrating partially with respect to y twice in succession, we obtain u — f(x) + yg{x) 9 
where / and g are arbitrary functions of x only. Thus in this case 


fix) + yg(x) 


(14.14) 


is a solution of (14.7). 

In each of the four cases under consideration we have obtained a solution of (14.7) 
which involves two arbitrary functions. Every equation of the form (14.7) with con- 
stant coefficients is in one and only one of the four categories covered by Cases 1 
through 4. Thus Equation (14.7) always has a solution which involves two arbitrary 
functions. This solution is given by (14.11) in Case 1, by (14.12) in Case 2, by (14.13) 
in Case 3, and by (14.14) in Case 4. 

We now consider two simple examples. 


► Example 14.1 

Find a solution of 

d 2 u d 2 u d 2 u 
dx 2 dx dy + dy 2 

which contains two arbitrary functions. 


(14.15) 


Solution. The quadratic equation ( 1 4. 1 0) corresponding to the differential equation 
(14.15) is 

m 2 — 5m + 6 = 0, 

and this equation has the distinct roots m l = 2 and m 2 = 3. Therefore this is an 
example of Case 1. Hence using (14.1 1) we see that 

u — f(y + 2x) + g(y + 3x) 

is a solution of (14.15) which contains two arbitrary functions. 
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► Example 14.2 


Find a solution of 

d 2 u d 2 u <3 2 u 
dx 2 8x dy 8y 2 

which contains two arbitrary functions. 


(14.16) 


Solution. The quadratic equation ( 1 4. 1 0) corresponding to the differential equation 
(14.16) is 

m 2 — 4m + 4 = 0, 

and this equation has the double root m i = 2. Therefore this is an example of Case 2. 
Hence using (14.12) we see that 

u = f(y + 2x) + xg(y + 2x) 

is a solution of (14.16) which contains two arbitrary functions. 


We close this section by classifying equations of the form (14.6) in the special case in 
which the coefficients A , B, C, D , E , and F are real constants. We shall illustrate this 
classification with certain of the famous equations of mathematical physics. 


DEFINITION 


The second-order linear partial differential equation 

. d 2 u d 2 u ^ d 2 u ^ du du 

A dS +B d^d~y + C d? + s~y + ’ 

where A, B , C, D , E , and F are real constants , is said to be 

1 . hyperbolic if B 2 — 4 AC > 0; 

2. parabolic if B 2 — 4 AC = 0; 

3. elliptic if B 2 - 4AC < 0. 


(14.6) 


► Example 14.3 


The equation 


8 2 u d 2 u 
dx 2 dy 2 


(14.17) 


is hyperbolic, since A = l, B = 0, C = - 1, and B 2 - 4 AC = 4 > 0. Equation (14.17) is a 
special case of the so-called one-dimensional wave equation , which is satisfied by the 
small transverse displacements of the points of a vibrating string. Observing that 
Equation (14.17) is a homogeneous linear equation with constant coefficients, we find 
at once that it has the solution 


w = f(y + x) + g(y - x), 

where / and g are arbitrary functions of their respective arguments. 
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► Example 14.4 


The equation 


d 2 u du 
dx 2 ~dy 


(14.18) 


is parabolic, since A = 1, B = C — 0, and B 2 — A AC = 0. Equation (14.18) is a special 
case of the one-dimensional heat equation (or diffusion equation ), which is satisfied by 
the temperature at a point of a homogeneous rod. We observe that Equation (14.18) is 
not homogeneous. 


► Example 14.5 


The equation 


d 2 u d 2 u _ 
dx 2 + dy 2 


(14.19) 


is elliptic, since A = 1 , B = 0, C = 1 , and B 2 — 4AC = — 4 < 0. Equation ( 1 4. 1 9) is the 
so-called two-dimensional Laplace equation , which is satisfied by the steady-state 
temperature at points of a thin rectangular plate. Observing that Equation (14.19) is a 
homogeneous linear equation with constant coefficients, we find at once that it has the 
solution 


w = f(y + ix) + g(y - ix), 

where / and g are arbitrary functions of their respective arguments. 


Exercises 


For each of the differential equations in Exercises 1-10 find a solution which contains 
two arbitrary functions. In each case determine whether the equation is hyperbolic, 
parabolic, or elliptic. 


d 2 u d 2 u d 2 u 

* d^~ 1 d^i' h6 dp ==0 ' 


3. 


d 2 u d 2 u 
dx 2 + dx dy 



= 0 . 


d 2 u d 2 u _ 
dx 2 + dy 2 

d 2 u ^ d 2 u 
dx 2 dx dy 


9. 


d 2 u d 2 u 
dx 2 dx dy 



d 2 u d 2 u d 2 u 
dx 2 +dxdy dy 2 

a a d 2u a d 2 u d 2 u 
‘ A dx I ~ 4 dxdy + dp = °' 


d 2 u d 2 u 

6 ' 2 d^d~y + 3 W = 


d 2 u ^ d 2 u 

8 * 


_ d 2 u 

+ f V = a 


10. 2 


d 2 u 

dx 2 


d 2 u d 2 u 
2 ~tefy + 5 d? = 0 - 



722 PARTIAL DIFFERENTIAL EQUATIONS 


1 1 . Consider the equation 


d 2 u d 2 u 
dx 2 + dx dy 



+ D 


du „ du 
te + E Ty 


+ Fu = 0. 


where 4, B , C, D , E , and F are functions of x and y. Extending the definition of the 
text, we say that this equation is 

(i) hyperbolic at all points at which B 2 — 4 AC > 0; 

(ii) parabolic at all points at which B 2 - 4 AC = 0; and 

(iii) elliptic at all points at which B 2 — 4 AC < 0. 

(a) Show that the equation 


/ 2 d 2 U 

,X ” ' ) dx‘ +2y dxdy 



is hyperbolic for all (x, y) outside the region bounded by the circle x 2 + 
y 2 = 1, parabolic on the boundary of this region, and elliptic for all (x, y) 
inside this region. 


(b) Determine all points (x, y) at which the equation 


d 2 u d 2 u 
dx 2 ^ X dx dy 


d 2 u 

+ >jp- xr 


du 

dx 


= 0 


(i) is hyperbolic; 

(ii) is parabolic; 

(iii) is elliptic. 


14.2 THE METHOD OF SEPARATION OF VARIABLES 
A. Introduction 

In this section we introduce the so-called method of separation of variables. This is a 
basic method which is very powerful for obtaining solutions of certain problems 
involving partial differential equations. Although the class of problems to which the 
method applies is relatively small, it nevertheless includes many problems of great 
physical interest. 

We have already pointed out that the mathematical statement of such a problem 
involves (1) a partial differential equation and (2) certain supplementary conditions. 
The solution of the problem is a function which satisfies both the equation and the 
various conditions, and the task before us is to find such a function. Let us recall a 
common procedure in solving analogous problems involving ordinary differential 
equations and supplementary conditions. In such a problem we first find the general 
solution of the ordinary differential equation and then apply the supplementary 
conditions, one at a time, to obtain the particular solution which satisfies them. Should 
we attempt to follow a similar procedure in problems in which the differential equation 
is partial instead of ordinary? Generally speaking, the answer is “no.” In the first place, 
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it is often impossible to obtain a “general” solution of the partial differential equation; 
and in the second place, even if such a solution can be found, it is usually impossible to 
apply the supplementary conditions to it. How then should we proceed? Instead of first 
finding the most general solution and then specializing it to obtain the particular 
solution desired, we shall first find particular solutions which satisfy some of the 
supplementary conditions and then combine these particular solutions in some manner 
to obtain a solution which satisfies all of the supplementary conditions. That is, instead 
of “breaking down” to the desired particular solution, the method of separation of 
variables in partial differential equations is one of “building up” to it. 

In attempting to apply the method of separation of variables to a partial differential 
equations problem, one makes a certain basic assumption at the outset. If the partial 
differential equation involves n independent variables x u x„, one first assumes 

that the equation possesses product solutions of the form X x X 2 * * • X n , where is a 
function of x { only (i = 1, 2, . . . , n). If the method actually does apply to the problem, 
then this basic assumption will produce n ordinary differential equations, one in each of 
the unknown functions X { (i = 1,2,..., n). Certain of the supplementary conditions of 
the problem lead to supplementary conditions which must be satisfied by certain of the 
functions X { . Thus at this stage of the method, one must solve n problems in ordinary 
differential equations, some of which involve initial or boundary conditions. The 
solutions of these n problems produce particular solutions of the form X l X 2 'X„ 
which satisfy some of the supplementary conditions of the original problem. One must 
now attempt to combine these particular solutions in some manner to produce a 
solution which satisfies all of the original supplementary conditions. One does this by 
first applying Theorem 14.2 to obtain a series solution and then applying the remaining 
supplementary conditions to this series solution. 

We point out that the procedure outlined in the preceding paragraph is a strictly 
formal one. We shall illustrate it in considerable detail in parts B , C, and D of this 
section, and our treatment there will again be of a strictly formal nature. We have not 
made, and shall not make, any attempt to justify the procedure; but at least we should 
be aware of some of the questions with which a rigorous treatment would be 
concerned. In the first place, there is the question of existence. Does a solution of the 
given partial differential equations problem actually exist? In analogous problems 
involving ordinary differential equations we have seen that existence theorems are 
available which often enable us to answer questions of this type. Existence theorems for 
problems in partial differential equations are also known (see Greenspan, Introduction to 
Partial Differential Equations , for example), and these theorems can often be applied to 
answer the existence equation. But even if such a theorem answers this question in the 
affirmative, how do we know that a solution can be built up by the method which we 
have outlined? Even if the method seems to work formally, the various steps require 
justification. In particular, questions of convergence enter the problem, and a rigorous 
justification of the method involves a careful study of these convergence questions. 
Finally, assuming that each step can be justified and that the final result is indeed a 
solution, the question of uniqueness enters the picture. Is the solution obtained the only 
solution of the problem? Uniqueness theorems are known which can often be applied 
to answer this question. 

In summary, our treatment of the method of separation of variables is a formal one. 
In a rigorous treatment one must show (1) that the formal “solution” obtained actually 
does satisfy both the partial differential equation and all of the supplementary 
conditions, and (2) that the solution thus justified is the only solution of the problem. 
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B. Example: The Vibrating String Problem 

We now illustrate the method of separation of variables by applying it to obtain a 
formal solution of the so-called vibrating string problem . 

The Physical Problem. Consider a tightly stretched elastic string the ends of 
which are fixed on the x axis at x = 0 and x = L. Suppose that for each x in the interval 
0 < x < L the string is displaced into the xy plane and that for each such x the dis- 
placement from the x axis is given by /(x), where / is a known function of x 
(see Figure 14.1). 



Suppose that at t = 0 the string is released from the initial position defined by /(x), 
with an initial velocity given at each point of the interval 0 < x < L by g(x), where g is a 
known function of x. Obviously the string will vibrate, and its displacement in the y 
direction at any point x at any time t will be a function of both x and t. We seek to find 
this displacement as a function of x and t ; we denote it by y or y(x, t ). 

We now make certain assumptions concerning the string, its vibrations, and its 
surroundings. To begin with, we assume that the string is perfectly flexible, is of 
constant linear density p, and is of constant tension T at all times. Concerning the 
vibrations, we assume that the motion is confined to the xy plane and that each point 
on the string moves on a straight line perpendicular to the x axis as the string vibrates. 
Further, we assume that the displacement y at each point of the string is small 
compared to the length L and that the angle between the string and the x axis at each 
point is also sufficiently small. Finally, we assume that no external forces (such as 
damping forces, for example) act upon the string. 

Although these assumptions are not actually valid in any physical problem, 
nevertheless they are approximately satisfied in many cases. They are made in order to 
make the resulting mathematical problem more tractable. With these assumptions, 
then, the problem is to find the displacement y as a function of x and t. 

The Mathematical Problem. Under the assumptions stated it can be shown that 
the displacement y satisfies the partial differential equation , 


2 d 2 y _ d 2 y 
dx 2 dt 2 ’ 


(14.20) 
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where a 2 = T/p . This is the one-dimensional wave equation, a special case of which we 
have already studied in Example 14.3. Since our primary concern here is to illustrate 
the method of separation of variables, we omit the derivation of the equation. 

Since the ends of the string are fixed at x = 0 and x = L for all time t, the 
displacement y must satisfy the boundary conditions 

y(0, t) = 0, 0 < t < oo; 

y(L, t) = 0, 0 < t < oo. (14.21) 

At t = 0 the string is released from the initial position defined by /(x), 0 < x < L, 
with initial velocity given by g(x), 0 < x < L. Thus the displacement y must also satisfy 
the initial conditions 


y(x, 0) = /(x), 0 < x < L; 


dy(x, 0) 

dt 


= g(x), 


0 < x < L. 


(14.22) 


This, then, is our problem. We must find a function y of x and t which satisfies the 
partial differential equation (14.20), the boundary conditions (14.21), and the initial 
conditions (14.22). 


Solution. We apply the method of separation of variables. We first make the basic 
assumption that the differential equation (14.20) has product solutions of the form XT, 
where A" is a function of x only and T is a function of t only. To emphasize this, we write 


y(x, t ) = X(x)T(t). 


(14.23) 


We now differentiate (14.23) and substitute into the differential equation (14.20). 
Differentiating, we find 


d 2 y 

dx 2 



and 


d 2 y y d 2 r 

dt 2 dt 2 ’ 


substituting, we obtain 


From this we obtain at once 


<x 2 T 


d 2 X _ d 2 T 
dx 2 dt 2 ' 

d 2 X d 2 T 
dx 2 dt 2 

~X~ = ^F' 


(14.24) 


Since X is a function of x only, the left member of (14.24) is also a function of x only 
and hence is independent of t. Further, since T is a function of t only, the right member 
of (14.24) is also a function of t only and hence is independent of x. Since one of the two 
equal expressions in (14.24) is independent of t and the other one is independent of x, 
both of them must be equal to a constant k. That is, we have 


d 2 X 

2 dx 2 
a z 


d 2 T 



X 


= k and 
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From this we obtain the two ordinary differential equations 

d 2 X k 


and 


dx 2 


d 2 T 

Hi 2 


X = 0 


-kT = 0. 


(14.25) 


(14.26) 


Let us now consider the boundary conditions (14.21). Since y(x, t) = X(x)T(t),WQSGG 
that y(0, t) = X(0)T(t) and y(L, t) = X(L)T(t). Thus the boundary conditions (14.21) 
take the forms 


X(0)T(t) = 0, 0< t< oo ; 

X(L)T(t) = 0, 0 < t < oo. 

Since T(t) = 0, 0 < t < oo, would reduce the assumed solution (14.23) to the trivial 
solution of (14.20), we must have 

X(0) = 0 and X(L) = 0. (14.27) 

Thus the function X in the assumed solution (14.23) must satisfy both the ordinary 
differential equation (14.25) and the boundary conditions (14.27). That is, the function 
X must be a nontrivial solution of the Sturm-Liouville problem 

0 -?*-°' < r25 > 


X(0) = 0, X(L) = 0. (14.27) 

We have already solved a special case of this problem in Example 12.3 of Chapter 12. 
Our procedure here will parallel the treatment in that example. We must first find the 
general solution of the differential equation (14.25). The form of this general solution 
depends upon whether k = 0, k > 0, or k < 0. 

If k = 0, the general solution of (14.25) is of the form 

X = c t + c 2 x. (14.28) 

We apply the boundary conditions (14.27) to the solution (14.28). The condition 
2f(0) = 0 requires that c L = 0. The condition X{L) = 0 becomes c t + c 2 L = 0. Since 
c t = 0, this requires that c 2 = 0 also. Thus the solution (14.28) reduces to the trivial 
solution. 

If k > 0, the general solution of (14.25) is of the form 

x = + C 2 e-^I°. (14.29) 

Applying the boundary conditions (14.27) to the solution (14.29), we obtain the system 
of equations 

Ci + c 2 = 0, 

c l e' /lLI ’‘ + c 2 e~' n ‘ L ' ,t = 0. 


To obtain nontrivial solutions of this system, we must have 

1 1 

e SkL/z e - JkL/a ~ °- 


(14.30) 
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But this implies that e^ Lla = e~^ L,a and hence that k = 0, contrary to our assumption 
in this case. Thus the system (14.30) has no nontrivial solutions, and so the solution 
(14.29) also reduces to the trivial solution. 

Finally, if k < 0, the general solution of (14.25) is of the form 

X = c t sin — + c 2 cos . (14.31) 

a a 

Applying the boundary conditions (14.27) to the solution (14.31), we obtain 

c 2 = 0 
and 

. J^kL yT—kh ^ 

c t sin — 1- c 2 cos — = 0. 

a a 

Since c 2 = 0, the latter condition reduces to 

c v sin — = 0. 

a 

Thus to obtain nontrivial solutions of the form (14.31), we must have 

= nn (n = 1, 2, 3,...), 
a 

and so 

n 2 n 2 ot 2 

k= jj~ (n = 1,2,3,...). (14.32) 

We thus find that the constant k must be a negative number of the form (14.32). We 
recognize these values of k as the characteristic values of the Sturm-Liouville problem 
under consideration. The corresponding nontrivial solutions (the characteristic func- 
tions) of the problem are then 

YITIX 

X n = c n sin— (n = 1,2,3,...), (14.33) 

where the c n (n = 1 , 2, 3, . . . ), are arbitrary constants. We thus find that the function X in 
the assumed solution (14.23) must be of the form (14.33). That is, corresponding to each 
positive integral value of n , we obtain functions X n of the form (14.33) which will serve 
as the function X in the product solution (14.23). 

Let us now return to the differential equation (14.26) which the function T in (14.23) 
must satisfy. Since k must be of the form (14.32), the differential equation (14.26) 
becomes 


where n = 1, 2, 3,.... 
of the form 

T n = 


d 2 T n 2 n 2 (x 2 

For each such value of n, this differential equation has solutions 
. nnoct nnat 

c „, i sin — — + c w>2 cos — (n = 1, 2, 3,...), (14.34) 
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where the c„ tl and c„ t2 (n = 1, 2, 3, ... ), are arbitrary constants. Thus the function T in 
the assumed solution (14.23) must be of the form (14.34). That is, corresponding to each 
positive integral value of n, we obtain functions T n of the form (14.34) which will serve as 
the function T in the product solution (14.23). 

Therefore, corresponding to each positive integral value of n (n = 1, 2, 3,...), we 
obtain solutions 


X„T = 


c„ sin- 


nnx 


. nnoct nnoct 

C nA Sin — Z h C„ t 2 COS — — 


which have the product form (14.23). 

We set a n = c n c n l and b n = c n c n%2 ( n = 1, 2, 3,. . .), and write these solutions as 


/ x T • nnx 
y„(x, t ) = sin — 


. nnoct nnoct 

a n sin — j— + b n cos -j— 


(n= 1,2,3,...). (14.35) 


We point out that each of these solutions (14.35) satisfies both the partial differential 
equation (14.20) and the two boundary conditions (14.21) for all values of the constants 
a n and b n . 

We must now try to satisfy the two initial conditions (14.22). In general no single one 
of the solutions (14.35) will satisfy these conditions. For example, if we apply the first 
initial condition (14.22) to a solution of the form (14.35) we must have 


, • nnx v 

b n sin — = /(x), 


0 < X < L, 


where n is some positive integer; and this is clearly impossible unless / happens to be a 
sine function of the form A sin(nnx/L) for some positive integer n. 

What can we do now? By Theorem 14.1 every finite linear combination of solutions 
of (14.20) is also a solution of (14.20); and by Theorem 14.2, assuming appropriate 
convergence, an infinite series of solutions of (14.20) is also a solution of (14.20). This 
suggests that we should form either a finite linear combination or an infinite series of 
the solutions (14.35) and attempt to apply the initial conditions (14.22) to the “more 
general” solutions thus obtained. In general no finite linear combination will satisfy 
these conditions, and we must resort to an infinite series. 

We therefore form an infinite series 


Z y«( x > 0 = E 


. nnxY • n7t c 

sin — — a n sin — — 

L L 


nnat , nnat 
+ b n cos — — 


of the solutions (14.35). Assuming appropriate convergence, Theorem 14.2 applies and 
assures us that the sum of this series is also a solution of the differential equation 
(14.20). Denoting this sum by y(x, t ), we write 


00 nnx 

y(x,t)= Z sin— a„ si 
n = 1 L ^ J 


nnoct , nnoct 
sin — ^ h b n cos — — 


(14.36) 


We note that y(0, t) = 0 and y(L, t) = 0. Thus, assuming appropriate convergence, the 
function y given by (14.36) satisfies both the differential equation (14.20) and the two 
boundary conditions (14.21). 
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Let us now apply the initial conditions (14.22) to the series solution (14.36). The first 
condition y(x, 0) = /(x), 0 < x < L, reduces (14.36) to 


^ 1 . ^TCX r/ V 

L b n sin — = /(x), 0 < x < L. 

n=l L 


(14.37) 


Thus to satisfy the first initial condition (14.22), we must determine the coefficients b n so 
that (14.37) is satisfied. We recognize this as a problem in Fourier sine series (see 
Section 12.4C). Using (12.54) we find that the coefficients b n are given by 


b = 


MIX 


/(x)sin— — dx ( n = 1, 2, 3,...). 


(14.38) 


Thus in order for the series solution ( 1 4.36) to satisfy the initial condition y(x, 0) = / (x), 
0 < x < L, the coefficients b n in the series must be given by (14.38). 

The only condition which remains to be satisfied is the second initial condition 

(14.22), which is 

dy(x 9 0) 


dt 


= g(x), 0 < x < L. 


From (14.36), we find that 
dy(x, t) 


dt 


£ f ratal I” . nnx~\ 

= L ~T~ Sin ~T“ a n 

n= 1 L L JL L J _ 


nnoct . nnott 
cos — b n sin 


KCtt 

T~_ * 


The second initial condition reduces this to 

a n nna . nnx 


Z u„nnw. . /i a , v _ 

— 7 — sin — = g(x), 0 < x < L. 

M = 1 L, L 

Letting A n = a„nnoc/L (n = 1, 2, 3, . . .), this takes the form 
00 nnx 

Z A » sin ~r~ = 9 (x), o < x < L. 

n = 1 ^ 


(14.39) 


Thus to satisfy the second initial condition (14.22), we must determine the coefficients 
A n so that (14.39) is satisfied. This is another problem in Fourier sine series. Using 
(12.54) again, we find that the coefficients A n are given by 


A = 


' / v • n7tX J 

^f(x)sin ax 


(n = 1, 2, 3,...). 


Since A n = a„nnot/L ( n = 1, 2, 3, . . .), we find that 
L 


a n = 


nnoc 


2 C L nnx 

A n = g(x)sin——dx (n = 1,2,3,...). 

nnoc J 0 L 


(14.40) 


Thus in order for the series solution (14.36) to satisfy the second initial condition 

(14.22) , the coefficients a n in the series must be given by (14.40). 

Therefore, the formal solution of the problem consisting of the partial differential 
equation (14.20), the two boundary conditions (14.21), and the two initial conditions 

(14.22) is 


y(x. 


® f . nnx~]V . nnoct nnoct 

0= L sin— a„ sin — 1- b n cos — - — 

n=! L L JL L ‘ 


(14.36) 
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where 


and 


-L f g(x)sin'^-dx (n = 1,2,3,...). 

(14.40) 

nnct Jo L 



K ~z 


f(x ) sin dx 


(n = 1,2, 3,...). 


(14.38) 


Summary. We briefly summarize the principal steps in the solution of this problem. 
The initial step was to assume the product solution XT given by (14.23). This led to the 
ordinary differential equation (14.25) for the function X and the ordinary differential 
equation (14.26) for the function T. We then considered the boundary conditions 

(14.21) and found that they reduced to the boundary conditions (14.27) on the function 
X. Thus the function X had to be a nontrivial solution of the Sturm-Liouville problem 
consisting of (14.25) and (14.27). The next step was to solve this Sturm-Liouville 
problem. We did this and obtained for solutions the functions X n given by (14.33). We 
then returned to the differential equation (14.26) for the function T and obtained the 
solutions T n given by (14.34). Thus, for each positive integral value of n , we found the 
product solutions X n T n denoted by y n and given by (14.35). Each of these solutions y n 
satisfied both the partial differential equation (14.20) and the boundary conditions 

(14.21) , but no one of them satisfied the initial conditions (14.22). In order to satisfy 
these initial conditions, we formed an infinite series of the solutions y n . We thus 
obtained the formal solution y given by (14.36), in which the coefficients a n and b n were 
arbitrary. We applied the initial conditions to this series solution and thereby 
determined the coefficients a n and b n . We thus obtained the formal solution y given by 
(14.36), in which the coefficients a n and b n are given by (14.40) and (14.38), respectively. 
We emphasize that this solution is a formal one, for in the process of obtaining it we 
made assumptions of convergence which we did not justify. 


A Special Case. As a particular case of the vibrating string problem, we consider 
the problem of the so-called plucked string. Let us suppose that the string is such that 
the constant a 2 = 2500 and that the ends of the string are fixed on the x axis at x = 0 
and x = 1. Suppose the midpoint of the string is displaced into the xy plane a distance 
0.01 in the direction of the positive y axis (see Figure 14.2). 
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Then the displacement from the x axis on the interval 0 < x < 1 is given by /(x), 
where 


m = 


X 1 

50' ^ — x ~2’ 

X 1 1 

50 + 50’ 2 £X£l 


(14.41) 


Suppose that at t = 0 the string is released from rest from the initial position defined by 
/ (x), 0 < x < 1 . Let us find the formal expression ( 1 4.36) for the displacement y(x, t) in 
the y direction, in this special case. 

The coefficients a n and b n in the expression (14.36) are given by (14.40) and (14.38), 
respectively. In the special case under consideration we have a = 50, L = 1, and /(x) 
given by (14.41). Further, since the string is released from rest, the initial velocity is 
given by g(x) = 0, 0 < x < 1. Therefore from (14.40) we find that 


a n = 0 (n = 1,2,3,...). 


Using (14.38), we find that 


b n = 2 /(x) sin nnx dx 

Jo 


r 1/2 

= 2 J. 


50 

2 . nn 


sin nnx dx + 2 


L(- 


55 + 55 |sin nKX dx 


-25„V S,n T ( n = 1- 2, 3,...). 

Hence the even coefficients b 2 „ = 0 (n = 1, 2, 3,. . and the odd coefficients are given 
by 

(— 1)" -1 2 

b2n - 1 = 257t 2 (2n- l) 2 ( ” = 1 ’ 2 ’ 3 ’---)' 

Therefore in the special case under consideration the expression (14.36) for the 
displacement is 

2 oo (—i)"- 1 

( ) = 2^1 Z ^ n _ i )2 sin [( 2n “ 0"^] cos[50(2n - l)wr]. 


C. An Example Involving the Laplace Equation 

We now consider a second example of the application of the method of separation of 
variables to a partial differential equations problem. The problem which we shall 
consider originated from a problem of physics, but we shall not enter into a discussion 
of this related physical problem. Our sole purpose in presenting this second example is 
to help the reader to gain greater familiarity with the various details of the method 
under consideration. 
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Problem. Apply the method of separation of variables to obtain a formal solution 
u(x , y) of the problem which consists of the two-dimensional Laplace equation 


d 2 u d 2 u 
dx 2 + dy 2 


(14.42) 


and the four boundary conditions 


u(0, y) = 0, 

0 < y < b; 

(14.43) 

u(a, y) = 0, 

0 <y<b; 

(14.44) 

u(x, 0) = 0, 

0 < x < a; 

(14.45) 

u(x, b) = f{x). 

0 < x < a. 

(14.46) 


We point out that the numbers a and b are definite positive constants and the function / 
is a specified function of x, 0 < x < a. 


Formal Solution. We first make the basic assumption that the differential equation 
(14.42) has product solutions of the form X Y, where X is a function of x only and Y is a 
function of y only. That is, we assume solutions 


u(x, y) = X(x)y(y). 


(14.47) 


We now differentiate (14.47) and substitute into the differential equation (14.42). 
Differentiating, we find 


d 2 u 

dx 2 


substituting, we obtain 


v d 2 X d 2 u v d 2 Y 

r ~d and ip =x d?'’ 




From this we obtain at once 


d 2 X d 2 Y 
dx 2 dy 2 

~Y~ = T 


(14.48) 


The left member of (14.48) is a function of x only and so is independent of y. The right 
member of (14.48) is a function of y only and so is independent of x. Therefore the two 
equal expressions in (14.48) must both be equal to a constant k. Setting each member of 
(14.48) equal to this constant k, we obtain the two ordinary differential equations 


d 2 X 

dx 2 


-kX = 0 


and 


d 2 Y 

dy 2 


+ kY = 0. 


Let us now consider the four boundary conditions (14.43) through (14.46). The first 
three of these are homogeneous, but the fourth one is not. Let us attempt to satisfy the 
three homogeneous conditions first. Since w(x, y) = A r (x)7(y), we see that the three 
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homogeneous conditions (14.43), (14.44), and (14.45) reduce to 
X(0)Y(y) = 0, 0 <y<b; 

X(a)Y(y) = 0, 0 < y < b; 

and 


X(x)Y(0) = 0, 0 <x < a\ respectively. 

Since either X(x) = 0, 0 < x < a, or Y(y) = 0, 0 < y < b, would reduce the assumed 
solution (14.47) to the trivial solution of (14.42), we must have 

X(0) = 0, X(a) = 0, and Y( 0) = 0. 

Thus the function X in the assumed solution (14.47) must be a nontrivial solution of 
the Sturm-Liouville problem 

^ -kX = 0, (14.49) 

ax 


X(0) = 0, X(a) = 0. (14.50) 

Further, the function Y in (14.47) must be a nontrivial solution of the problem 

d 2 Y 

—7 + fcY = 0, (14.51) 

ay 

HO) = 0. (14.52) 

The Sturm-Liouville problem (14.49) and (14.50) is essentially the same as the 
problem (14.25) and (14.27) which we encountered and solved in connection with the 
vibrating string problem in Part B of this section. Indeed, the present problem (14.49) 
and (14.50) is merely the special case of the problem (14.25) and (14.27) in which a 2 = 1 
and L = a. Thus if we set a 2 = 1 and L = a in the results (14.32) and (14.33) of the 
problem (14.25) and (14.27), we shall obtain the desired results for the present problem 
(14.49) and (14.50). Doing this, we first find from (14.32) that the constant k in (14.49) 
must be given by 

n 2 n 2 

k= ( n = U2,3,...). (14.53) 


Then from (14.33) we find that the corresponding nontrivial solutions of the problem 
(14.49) and (14.50) are 


X n = c n sin 


nnx 


(n = 1, 2, 3,...), 


(14.54) 


where the c n (n = 1, 2, 3,...) are arbitrary constants. That is, corresponding to each 
positive integral value of n , we obtain functions X n of the form (14.54) which will serve 
as the function X in the product solution (14.47). 

We now return to the problem (14.51) and (14.52) involving the function Y. Since k 
must be of the form (14.53), the differential equation (14.51) becomes 


d 2 Y 

dy 2 


n 2 n 2 


Y = 0, 


where n = 1, 2, 3,.... For each such value ofn, this differential equation has the general 
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solution 


Y n = W"’" + c nt2 e-™* a (n = 1, 2, 3,...), 

where c nl and c„ t2 (n = 1,2,3,...), are arbitrary constants. In order to satisfy the 
condition (14.52), we must have 

C n , 1 ^n, 2 ^ 1 5 2 , 3, . . . ). 

Thus nontrivial solutions of the problem (14.51) and (14.52) are 

Y n = c nA (e n ^ a -e~ nny/a ) (n= 1,2,3,...), 

where the c nl (n = 1, 2, 3,...), are arbitrary constants. Using the identity e e — 
e~ d = 2 sinh 8 , we may put these solutions in the form 


7„ = d nl sinh 


nny 


(n = 1,2, 3,...), 


(14.55) 


where the c' n l ( n = 1, 2, 3,...), are arbitrary constants. Thus, corresponding to each 
positive integral value of n, we obtain functions T„of the form (14.55) which will serve as 
the function Y in the product solution (14.47). 

Hence, corresponding to each positive integral value of n(n = 1, 2, 3, . . .), we obtain 
solutions 


y v = r 

n 1 n c n 


. nnx , . , nny 

sin c n sinh 

a ’ a 


which have the product form (14.47). We set C n = c n c' nA (n = 1, 2, 3, . 
solutions as 


and write these 


nnx nn v 

y) = C n sin sinh (n = 1, 2, 3,. . .). (14.56) 

a a 


Each one of these solutions (14.56) satisfies both the partial differential equation (14.42) 
and the three homogeneous boundary conditions (14.43), (14.44), and (14.45) for all 
values of the constant C n . 

We must now apply the nonhomogeneous boundary condition (14.46). In order to 
do this, we form an infinite series 


v ^ ^ . nnx nny 

I K(x, y)= L C n sin — sinh 

n=l n = 1 a a 


of the solutions (14.56). Assuming appropriate convergence, Theorem 14.2 applies and 
assures us that the sum of this series is also a solution of the differential equation 
(14.42). Denoting this sum by u(x, y), we write 


, x £ ^ • nnx • i nn y 

u(x, v) = Y C n sin sinh . (14.57) 

n = 1 & Cl 

We observe that u{ 0, y) = 0, u(a , y) = 0, and u(x, 0) = 0. Thus, assuming appropriate 
convergence, the function u given by (14.57) satisfies both the differential equation 
(14.42) and the three homogeneous boundary conditions (14.43), (14.44), and (14.45). 

We now apply the nonhomogeneous boundary condition (14.46) to the series 
solution (14.57). Doing so, we obtain at once 

£ ^ . nnx . , nnb r/ x 

> C n sin sinh = j (x), 0 < x < a. 

n = 1 Cl Cl 
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Letting A n = C n sinh(«7cfo/a) (n = 1, 2, 3,...), this takes the form 

00 nnx 

Y A n sin = /(x), 0 < x < a. (14.58) 

n = 1 0 

Thus in order to satisfy the condition (14.46), we must determine the coefficients A n so 
that (14.58) is satisfied. This is a problem in Fourier sine series. Using (12.56), we find 
that the coefficients A n are given by 

A n = - /(x)sin^^dx (n = 1,2,3,...). 

ci * o ^ 

Since ,4„ = C n sinh(nnb/a) ( n = 1, 2, 3, . . .), we find that 
A 2 C a mix 

C„ = ” j~ = r f(x) Sin dx (n = 1, 2, 3, . . .). (14.59) 

. , nub . , In a 

smh a sinh * 

a a 

Thus in order for the series solution (14.57) to satisfy the nonhomogeneous boundary 
condition (14.46), the coefficients C n in the series must be given by (14.59). 

Therefore the formal solution of the problem consisting of the partial differential 
equation (14.42) and the four boundary conditions (14.43) through (14.46) is 


where 


™ . nnx . . nny 

u ( x > y) = X C„ sin sinh , 

n = 1 a a 


2 nnx , 

/(x)sin dx (n = 1,2,3,. 

. , /17CO o 

a sinh c/u 


(14.57) 


(14.59) 


D. An Example Involving Bessel Functions 

In this example we shall apply the method of separation of variables to a problem in 
which the partial differential equation has a variable coefficient. As a result of this 
variable coefficient we shall encounter certain difficulties which were not present in the 
two previous examples. Further, we shall need to know a few results which we have not 
yet proved. Whenever such a result is needed, we shall state it without proof. 


► Problem Apply the method of separation of variables to obtain a formal solu- 
tion u(x, t) of the problem which consists of the partial differential equation 


d 2 u 1 du du 
dx 2 x dx dt 


(14.60) 


and the three conditions 


1. u(L,t) = 0, t > 0; (14.61) 

2. u(x, 0) = f(x), 0 < x < L, (14.62) 

where / is a prescribed function of x, 0 < x < L; and 

3. lim u(x, t) = 0 (14.63) 

t -> oo 


for each x, 0 < x < L. 
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Formal Solution. We begin by making the basic assumption that the differential 
equation (14.60) has product solutions of the form 

u(x,t) = X(x)T(t), (14.64) 

where X is a function of x only and T is a function of t only. Upon differentiating (14.64) 
and substituting into the differential equation (14.60), we obtain 

d 2 X 1 dX dT 

T -7~t + - T— = X — . 
dx x dx dt 


From this we obtain at once 


1 (d 2 X 1 dX\ _ 1 dT 
X\dx T + xl^)~T~dt' 


(14.65) 


The left member of (14.65) is a function of x only and so is independent of t . The right 
member of (14.65) is a function of t only and so is independent of x. Therefore the two 
equal expressions in (14.65) must both be equal to a constant k. Setting each member of 
(14.65) equal to this constant k , we obtain the two ordinary differential equations 


d 2 X 

dx 2 


1 dX , v „ 
+ -kX = 0 
x dx 


(14.66) 


and 

I rri 

— - kT = 0. (14.67) 

dt 

The effect of the variable coefficient (1/x) in the partial differential equation (14.60) 
appears here, for the ordinary differential equation (14.66) also has this same variable 
coefficient. 

We shall need the general solutions of both of the ordinary differential equations 
(14.66) and (14.67). Equation (14.67) is the more promising of the two; let us work with 
it first. We find at once that the general solution of (14.67) is of the form 

T=Ce kt , (14.68) 

where C is an arbitrary constant. 

Let us now examine the three conditions (14.61), (14.62), and (14.63) to see if any of 
them will lead to further information about the solution (14.68). The first two of these 
conditions lead to conditions on the function X. Let us therefore examine the third 
condition (14.63). Since u(x, t) = X(x)T(t), this condition reduces to 

X(x)[lim T(t)] = 0 

t -> oo 

for each x,0 < x < L. Hence we require that 

lim T(t) = 0. 

t~> 00 

From this we see that the constant k in (14.68) must be a negative number. Therefore we 
set k = - 1 2 , where X is real and positive. The general solution (14.68) of the differential 
equation (14.67) now takes the form 

T = Ce~ x2 \ 


where C is an arbitrary constant. 


(14.69) 
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Let us now return to the differential equation (14.66) for X. Since k = —A 2 it now 
takes the form 


or equivalently, 


d 2 X 1 dX 
dx 2 + x dx + 



dX 

+ X — h 

dx 


X 2 X = 0 


X 2 x 2 X = 0. 


(14.70) 


The transformation 9 = kx reduces (14.70) to the equation 


n0 d 2 X n dX n 

e 7iF + e M +ex - 0 ' 


We readily recognize this equation as the Bessel equation of order zero. Its general 
solution may be written 

X = Cl J o (0) + c 2 Y o (0), 

where J 0 and Y 0 are the Bessel functions of order zero of the first and second kind, 
respectively, and c i and c 2 are arbitrary constants (see Section 6.3). Thus the general 
solution of (14.70) may be written 

X = cMlx) + c 2 Y 0 (Xx), (14.71) 

where c i and c 2 are arbitrary constants. 

Let us now return to the condition (14.63). This condition requires that 

lim u(x , t) = 0 

t-> 00 

for each x, 0 < x < L. For x = 0 this becomes 

lim u( 0, f) = 0 

t~> oo 

or 


X(0)[lim T(f)] = 0. 

t -* 00 

In order to satisfy this condition we must require that X(0) be finite. We recall that 
J 0 ( 0) = 1. However, it can be shown that 

lim y 0 (Ax) = — oo. 

X-> 00 

Thus in order for X(0) to be finite we must set c 2 = 0. Thus the solution (14.71) reduces 
to 


X = Cl J 0 {Xx). (14.72) 

Let us now consider the condition (14.61). We have already noticed that this 
condition leads to a condition on X. Indeed, it reduces to 

X(L)T(t) = 0, t >0; 

thus we must have X(L) = 0. Applying this to the solution (14.72), we see that X must 
satisfy the equation 


J 0 (XL) = 0. 


(14.73) 
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In Section 6.3B we pointed out that the function J 0 has a damped oscillatory behavior 
as x -► + oo. Thus the equation J 0 (x) = 0 has an infinite sequence of positive roots 
x n (n = 1,2,3,...). Let us arrange these positive roots such that x n < x n+l ( n = 1, 
2, 3, . . .). Then there exists a monotonic increasing sequence of positive numbers 

K = j; («= 1,2,3,...), 

each of which satisfies Equation (14.73). Thus corresponding to each positive integer n , 
the differential equation (14.70) has solutions which satisfy the condition (14.61). These 
solutions are of the form 

X n = c u MK*) (n= 1,2,3,...), (14.74) 

where the c ln (n = 1, 2, 3,. . .) are arbitrary constants and the X n ( n = 1, 2, 3,. ..)are the 
positive roots of Equation (14.73). That is, corresponding to each positive integer n , we 
obtain functions X n of the form (14.74) which will serve as the function X in the product 
solution (14.64). 

Let us now return to the solution (14.69) of the differential equation (14.67). We see 
that, corresponding to each positive integer n , the differential equation (14.67) has 
solutions of the form 


T n = c 2tn e- X » 2t (n = 1,2,3,...), (14.75) 

where the c 2n {n = 1, 2, 3, . . .) are arbitrary constants and the X n (n = 1, 2, 3, . . .) are the 
positive roots of Equation (14.73). That is, corresponding to each positive integer n , we 
obtain functions T n of the form (14.75) which will serve as the function T in the product 
solution (14.64). 

Hence, corresponding to each positive integral value of n (n = 1, 2, 3, . . .), we obtain 
product solutions of the form 

u n (x, t) = A n J 0 (X„x)e~ x " 2 ' ( n = 1, 2, 3,. . .), (14.76) 

where the A n = c Un c 2f „ (n = 1, 2, 3,...) are arbitrary constants.. Each one of these 
solutions (14.76) satisfies the partial differential equation (14.60) and the conditions 
(14.61) and (14.63) for all values of the constant A n . 

We must now apply the initial condition (14.62). In order to do this, we form an 
infinite series of the solutions (14.76). Assuming appropriate convergence, the sum of 
this series is also a solution of the partial differential equation (14.60). We denote this 
sum by u(x, t) and thus write 

«(*,*)= t AJoMe-*" 2 '. (14.77) 

n = 1 

Applying the initial condition (14.62) to the series solution (14.77), we obtain 

00 

£ A„J 0 (X„X) = f{x), 0 <x<L. (14.78) 

n= 1 

Thus in order to satisfy the initial condition (14.62), the coefficients A n must be 
determined so that (14.78) is satisfied. In other words, we must expand the function / in 
a series of Bessel functions of the first kind of order zero, valid on the interval 0 < 
x < L. 

Here we have encountered a new difficulty, and this difficulty leads to matters which 
are outside the scope of this book. Nevertheless, we shall indicate briefly what can be 
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done. This is one place where we need to know certain of the results which we referred 
to at the beginning of this section. Let us state them and get on with the problem! 

In the first place, it can be shown that if the numbers X n ( n = 1, 2, 3,...) are the 
positive roots of the equation J 0 (1L) = 0, then the set of functions defined by { J 0 (X n x)} 
(n = 1, 2, 3, . . . ) is an orthogonal system with respect to the weight function r such that 
r(x) = x on the interval 0 < x < L. Therefore, 

*L 

xJ 0 (l m x)J 0 (^x) dx = 0 (m = 1, 2, 3,...; n = 1, 2, 3,...; m # n). 

Jo 

Further, if m = n, we have 

"L 

x[j 0 (]„x)] 2 dx = r„ > 0 (n = 1, 2, 3,...). (14.79) 

Jo 

In Section 12.3A we learned how to form a set of orthonormal functions from a set of 
orthogonal characteristic functions of a Sturm- Liouville problem. Applying this 
procedure to the orthogonal set defined by {i 0 (A„x)}, we obtain the corresponding 
orthonormal system {<£„}, where 

</>„(*) = („ = 1,2,3,...), (14.80) 

and T„ (n = 1, 2, 3, . . .) is given by (14.79). Let us now recall the results of Section 12.3B 
concerning the formal expansion of a function / in a series 

oo 

Z C n*fin 
n = 1 


of orthonormal functions {</>„}. According to (12.37) the coefficients c n in the expansion 
of / in the series of orthonormal functions </>„ defined by (14.80) are given by 


1 


c„ = 


" JT. 


xf(x)J 0 {X„x) dx (n = 1,2,3,...). 


Thus this expansion takes the form 


1 f 


--U/LJ o 


x f(x)J 0 (X„x) dx 


1 JoM 

J y/T m ’ 


and we write formally 

/(*)= z 


xf(x)J 0 U„x) dx 


J 0 {X„x), 0 < x < L. 


(14.81) 


Comparing (14.78) and (14.81), we see that if the coefficients A„ in (14.78) are given by 


A„ = 


1 


xf(x)J 0 (X„x) dx ( n = 1, 2, 3, . . .), 


(14.82) 


then the requirement (14.78) will be formally satisfied. We note that the constants T„ in 
(14.82) are given by (14.79). That is, 

*L 

x[J 0 (l„x)] 2 dx ( n = 1, 2, 3, . . .). 


-r 
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This integral can be evaluated in terms of values of the Bessel function of the first kind 
of order one, Indeed, it can be shown that 

£ x[J 0 (A„x)] 2 dx = y IMKL)? (« = 1, 2, 3, . . .), 

and thus 

r„ = y IMKL)V (« = 1,2,3,...). 

Thus the coefficients A n in (14.78) are given by 

A " = L 2 [J t (A W Jo dx ( n = l ’ 2 ’ 3 ’ • ' (14.83) 

Finally, then, we obtain the formal solution of the problem consisting of the partial 
differential equation (14.60) and the conditions (14.61), (14.62), and (14.63). The formal 
solution is 


u(x, 0 = z A nMKx)e x " 2 ', 

11 = 1 

where 

A " = L 2 [J t (A L)] 2 [ dx (n = 12,3,...), 

the X n (n = 1, 2, 3, . . .), are the positive roots of the equation J 0 (AL) = 0, and J 0 and J l 
denote the Bessel functions of the first kind of orders zero and one, respectively. 

Remarks and Observations. At the risk of being unduly repetitious, but with the 
good intentions of promoting a cautious attitude, we emphasize that the results which 
we have obtained are strictly formal results. We assumed “appropriate convergence” of 
the series in (14.77), and we have said nothing concerning the convergence of the Bessel 
function expansion which we obtained for the function f In order to relieve our 
consciences concerning the latter point, we state that there do exist classes of functions 
/ such that the expansions (14.78), in which the coefficients A n are given by (14.83), is 
valid on the interval 0 < x < L. The study of these classes is definitely beyond the scope 
of this book, and we refer the reader to more advanced works for a discussion of this 
and other pertinent problems of convergence. 

Finally, we point out that the problem which we have considered here gives some 
indication of the types of difficulties which may be encountered if the method of 
separation of variables is applied to a problem in which the partial differential equation 
has variable coefficients. The variable coefficient (1/x) in the partial differential 
equation (14.60) led to the variable coefficient (1/x) in the ordinary differential equation 
(14.66) which resulted from the separation of the variables. A similar situation occurs in 
other problems in which the partial differential equation has variable coefficients. In 
such problems, one or more of the ordinary differential equations which result from the 
separation process will also contain variable coefficients. Obtaining the general 
solutions of these ordinary differential equations can then be a formidable task in its 
own right. But even if these general solutions can be obtained, they may involve 
functions which will lead to further difficulties when one attempts to apply certain of 
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the supplementary conditions. This sort of difficulty occurred in the problem of this 
section when we tried to apply the initial condition (14.62). We were forced to consider 
the problem of expanding the function / in a series of Bessel functions, valid on the 
interval 0 < x < L. A similar situation often occurs in other problems which involve a 
partial differential equation with variable coefficients. In such problems one is faced 
with the task of expanding a prescribed function / in a series of nonelementary 
orthonormal functions {</>„}, valid on a certain interval. The set of orthonormal 
functions {</>„} might happen to be one of the many such sets which have been carefully 
studied, or it might turn out to be a set about which little is known. In any case, 
additional difficulties may occur which will necessitate further study and possibly some 
research. 


Exercises 


Use the method of separation of variables to find a formal solution y(x, t) of each of the 
problems stated in Exercises 1-4 

d 2 y d 2 j i 
dx 2 dt 2 ’ 


)>(0, t) = 0, 
y(n, t ) = 0, 
y(x, 0) = sin 

dy(x, Q) _ n 
dt 


0 < t < oo, 

0 < t < oo, 

2x, 0 < x < n, 

0 < x < n. 


d 2 y d^y 
dx 2 dt 2 ’ 


y(0, t) = 0, 0 < t < oo, 

y(3n, t) = 0, 0 < t < oo, 


3 . 


y(x, 0) = 2 sin 

Syix, 0) _ 
dt 

d 2 y d 2 y 
dx 2 dt 2 ’ 


0 < 


0 < x < 3n, 
x < 3n. 


y( 0, t) = 0, 0 < t < oo, 
y( 3, t) = 0, 0 < t < oo, 

x, 0 < x < 1, 
1, 1 < x < 2, 

I 3 — x, 2 < x < 3, 


y(x, 0) = 
dy(x, 0) 


dt 


= 0, 0 < x < 3. 
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4. 




y(0, t) = 0, 0 < t < oo, 
y(n, t) = 0, 0 < t < oo, 
y(x, 0) = sin 2 x, 0 < x < 7i, 
dy(x , 0) 

— - — = sin x, 0 < x < n. 
ot 

5. Apply the method of separation of variables to obtain a formal solution u(x, t) of 
the problem which consists of the heat equation 

8 2 u 8u 
dx 2 dt ’ 


the boundary conditions 

u( 0, t) = 0, t > 0, 
u(L, t) = 0, t > 0, 

and the initial condition 

u(x, 0) = /(x), 0 < x < L, 


where L > 0, and / is a specified function of x, 0 < x < L. 

6. Use the method of separation of variables to find a formal solution w(x, y) of the 
problem which consists of the partial differential equation 


where A > 0 and a > 


d 2 u du 
dx 2 dy 

0, and the conditions 


Ae~ 


u( 0, y) = 0, y > 0, 

u(L, y) = 0, y > 0, 

u(x, 0) = /(x), 0 < x < L. 

[Hint: Let u(x, y) = t;(x, y) + ^(x), where i// is such that i; satisfies the “homo- 

geneous” equation 


d 2 v dt) 
dx 2 dy 


and the homogeneous boundary conditions 

v(0, y) = 0, i;(JU y) = 0.] 

7. Use the method of separation of variables to find a formal solution u(x, y) of the 
problem which consists of Laplace’s equation 


d 2 u d 2 u 
dx 2 + dy 2 
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and the boundary conditions 


«( 0, y) = 0, 

0 < y < n, 

u(n, y) = 0, 

0 < y < n, 

u(x, 7t) — 0, 

0 < x < n. 

u(x, 0) = f(x). 

0 < x < n, 


where / is a specified function of x, 0 < x < n. 

8. Use the method of separation of variables to obtain a formal solution u(r, 0) of the 
problem which consists of the equation 

8 2 u 1 du 1 d 2 u 

dr 2 + r dr + r 2 SO 2 

the periodicity condition 

u(r , 9) = u(r , 9 + 2n), 0 < r < L, for all values of 9 , 

and the boundary conditions 

u( 0, 9) = a, where a is finite, 

u(L , 9) = f ( 9 ), where / is a prescribed function of 9 , for all values of 9. 


14.3 CANONICAL FORMS OF SECOND-ORDER LINEAR EQUATIONS 
WITH CONSTANT COEFFICIENTS 


A. Canonical Forms 


In this section we restrict our attention to second-order linear partial differential 
equations of the form 


d 2 u d 2 u 

dTdj 


_ d 2 u ^ du „ du 
+ C — j + D - — h E — + Fu — 0, 
dy dx dy 


(14.84) 


in which the coefficients A , B , C, D, £, and F are real constants . This equation is a special 
case of the more general equation (14.6) in which these coefficients are functions of x 
and y. In Section 14.1 we classified such equations according to the sign of B 2 — AAC, 
Using this classification, Equation (14.84) is said to be 


1. hyperbolic if B 2 — AAC > 0; 

2. parabolic if B 2 — AAC = 0; 

3. elliptic if B 2 — AAC < 0. 


We shall now show that in each of these three cases Equation (14.84) can be reduced 
to a more simple form by a suitable change of the independent variables. The simpler 
forms which result in this way are called canonical forms of Equation (14.84). We 
therefore introduce new independent variables £, rj by means of the transformation 

£ = £(x, y), rj = rj(x , y). (14.85) 
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We now compute derivatives of u , regarding 5 and rj as intermediate variables, so that 
u = u(5, rj ), where 5 and rj are given by (14.85). We first find 


and 


du _ du di l du drj 
dx 35 dx + drj dx 


du _ du d 5 du drj 
dy di l dy + drj dy * 


Using (14.86), we next determine 


d 2 u 
dx 2 ’ 


We find 


d 2 u 

dx 2 


d ( du 


dx \dx y 

du d (d£\ d£, d (du\ du d ( drj\ drj d 


35 dx \3x ) ^ dx dx \d£, ) ' drj dx \dx J ' dx dx 


Since 5 = 5(x, y) and rj = rj(x , y), 


d (dt 


and 


dx \dx 

d /drj 
dx \3x 


— is simply 


dx 2 


is simply 


d 2 rj 
dx 2 ’ 


(14.86) 


(14.87) 


(14.88) 


However, in computing 


d (du 


dx V35 


and 


d (du 


, the situation is somewhat more 


- n-/ dx \ dr 0 

complicated and we must be careful to remember what is involved. We are regarding u 

as a function of 5 and rj , where 5 and rj are themselves functions of x and y. That is, 

du du 

u = u(5, rj), where 5 = 5(x, y) and rj = rj(x , y). Thus — and — are regarded as func- 

OQ drj 

du 

tions of 5 and rj , where 5 and rj are themselves functions of x and y. That is, — = 1^(5, rj) 

d 5 

du 

and — = u 2 (& */)> where in each case £ = £(x, y) and = t/(x, y). With this in mind, 
drj 


we compute 


d (du 


dx V35 


. We have 


3 / 3u\ _ 3 St/! 35 du i drj 

3x\35/ dx Ul ’ ^ 35 3x drj dx 5 


and since we thus find 


3 / 3u\ 


3 2 m 3^ 3 2 u drj 

35 2 dx + drj 35 3x’ 
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In like manner, we obtain 


d f 3iA d 2 u dd d 2 u drj 
dx \drj) dd drj dx + drj 2 dx' 

Substituting these results into (14.88), we thus obtain 

d 2 u dud 2 d dd[d 2 udd d 2 u drj 


dx \dd 2 dx drj dd dx 


du d 2 rj drj / d 2 u dd d 2 u drj 
+ drj dx 2 + dx drj dx + drj 2 dx 


Assuming u(d, rj) has continuous second derivatives with respect to d and rj , the so- 
called cross derivatives are equal, and we have 


d 2 u d 2 u /d£\ 2 
dx 2 dd 2 l dx ) + 


d 2 u dd drj d 2 ufdrj N 
dd drj dx dx + drj 2 \dx, 


In like manner, we find 


du d 2 d du d 2 rj 
+ dd dx 2 + drj dx 2 ' 


d 2 u dd dd d 2 u f dd drj dd drj 






dx dy dd 2 dx dy dd drj [ _dx dy dy dx J 

d 2 u drj drj du d 2 d du d 2 rj 
+ drj 2 dx dy + dd dx dy + drj dx dy 


(14.89) 


(14.90) 


d 2 u d 2 u f df^ 
dy 2 dd 2 \dyj 


d 2 u dd drj d 2 u fdrj\ 2 
+ dd drj dy dy drj 2 l dy ) 


du d 2 d du d 2 rj 
+ dd dy 2 + drj dy 2 ' 


(14.91) 


We now substitute (14.86), (14.87), (14.89), (14.90), and (14.91) into the partial 
differential equation (14.84), to obtain 


+2 


d 2 u drj d 2 u ( dtj 


+ B \_Wdx 8y + 


dd drj dx dx drj 2 \dx J c 

dd dd d 2 u / dd drj dd drj\ 
dx dy + dd, drj dy + dy dx J + 

/ 3£\ 2 d 2 u dd, drj d 2 u f drj\ 2 du d 2 d du d 2 rj 

dd drj dy dy + drj 2 l dy + ^ ^ ^ * 


drj V du d 2 d du d 2 rj 
dx) + dd dx 2 drj dx 2 


dd drj dd drj\ d 2 u drj drj du d 2 d du d 2 rj 
dx dy + dy dx ) + drj 2 dx dy dd dx dy drj dx dy 


jS 

_dd dx drj dx 


] +£ t 


du d£ du drf I 

dtdy + tydy + U = 0 ' 


Rearranging terms, this becomes 


+ B^ + C + 2 A 



dd drj dd drj 
dx dy dy dx 
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+ 2C 


d t, drj 
dy dy 


d 2 u 


dZ drj 


+ 


Af dr i y , R dr i dt i , c ( dr, ' r 
M ai) + B *^ + C W 


d 2 u 

drj 2 


{ 


j. | a 4. b J^1_ c 1 d — + E — 
+ ' dx 2+ dxdy + dy 2+ dx + dy 


du 

drj 


+ Fu = 0. 


Thus using the transformation (14.85), the equation (14.84) is reduced to the form 

(14.92) 


. d 2 u d 2 u ^ d 2 u ^ du _ du 

Al W + l Wdj + l W + l dZ + l hl + lU= ' 


where 


Ay = A 


B l = 2 A 
Cy = A 


dt\ 2 n dtdt „ 

+ B irir + c 

ox dy 


dx 

d^dri_ 

dx dx 

dt] ^ 2 
dx 


d t> 2 

dy: 


. d£ drj dt, drj . ^ 

+ B00 + 00 +2C 


<3x <3y dy dx 

drj^ 2 


,dZdr[ 

dy dy ’ 


_ dri drj 

+ B^-^ + C 
dx dy 


dy 


(14.93) 


d 2 t d 2 t d 2 t dt dt 


dx 2 ' ~ dx dy 


d 2 rj 

dx 2 


Ei-A^+B—.+ Cj^ + D^ + E^, 


, d 2y 1 

dy 2 


drj_ 

dx 


drj_ 

dy 


and 


F l = F. 


We now show that the new equation (14.92) can be simplified by a suitable choice of 
£(x, y) and rj(x , y) in the transformation (14.85). The choice of these functions £ and rj 
and the form of the resulting simplified equation (the canonical form) depend upon 
whether the original partial differential equation (14.84) is hyperbolic, parabolic, or 
elliptic. 


B. The Hyperbolic Equation 

Concerning the canonical form in the hyperbolic case, we state and prove the following 
theorem. 


THEOREM 14.3 


Hypothesis. Consider the second-order linear partial differential equation 

A d 2 u „ d 2 u ^ d 2 u ^ du du 
A — j + B t — h C — 2 + F >- — h £ — + Fu = 0, 
dx z dx dy dy z dx dy 


(14.84) 
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where the coefficients A, B, C, D, E, and F are real constants and B 2 — 4 AC > 0 so that 
the equation is hyperbolic. 


Conclusion. There exists a transformation 

Z = Z(x, y), n = t](x, y) 


(14.85) 


of the independent variables in (14.84) so that the transformed equation in the independent 
variables (£, tj) may be written in the canonical form 


d 2 u ,du du 
77 a = d-rz + e — + fu, 
d£ drj dt] 


(14.94) 


where d, e, and f are real constants. 

If A ^ 0, such a transformation is given by 


Z = X v x + y, 
t] = X 2 x + y, 

where A, and X 2 are the roots of the quadratic equation 

AX 2 + BX + C = 0. 

If A = 0, B / 0, C / 0, such a transformation is given by 


(14.95) 


(14.96) 


Z = x, 

B 

r=x-- y . 


(14.97) 


// A = 0, B # 0, C = 0, such a transformation is merely the identity transformation 

i = x, 

(14.98) 

rj = y. 


Proof. We shall first show that the transformations given by (14.95), (14.97), and 
(14.98) actually do reduce Equation (14.84) so that it may be written in the form (14.94) 
in the three respective cases described in the conclusion. We shall then observe that 
these three cases cover all possibilities for the hyperbolic equation, thereby completing 
the proof. 

We have seen that a transformation of the form (14.85) reduces Equation (14.84) to 
the form (14.92), where the coefficients are given by (14.93). In the case A ^ 0, we apply 
the special case of (14.85) given by 


z = X t x + y. 


t] = X 2 x + y, 


(14.95) 


where X t and X 2 are the roots of the quadratic equation 

AX 2 + BX + C = 0. (14.96) 


Then the coefficients in the transformed equation (14.92) are given by (14.93), where 
£(x, y) and tj(x, y) are given by (14.95). Evaluating these coefficients in this case we find 
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that 


A i — A A 2 + BA± + C, 

B t = 2AA t A 2 4- B(A t 4- A 2 ) 4- 2 C 



C x = AA\ 4- BA 2 4- C, 
D t = DA± 4 ” E , 

E l = DA 2 4- E , 
F 1 = F. 


Since and i 2 satisfy the quadratic equation (14.96), we see that /4 X = 0 and = 0. 
Therefore in this case the transformed equation (14.92) is 



|^ + (W 1+ £,| + , Wj + £)| + fU -0. 


(14.99) 


Since B 2 — 4AC > 0 [Equation (14.84) is hyperbolic], the roots A t and A 2 of (14.96) are 
real and distinct. Therefore the coefficients in (14.99) are all real. Furthermore, the 
leading coefficient (B 2 — 4AC)/( — A) is unequal to zero. Therefore we may write the 
transformed equation (14.99) in the form 


where the coefficients 


d 2 u flu du 

W^, = d K + e di + fu ’ 


A(Dl l + E) _ A(D1 2 + E) 
B 2 - 4/4 C ’ e _ B 2 - 4AC ’ 


/ = 


AF 

B 2 - 4AC 


are all real. This is the canonical form (14.94). 

Now consider the case in which A = 0, B # 0, C # 0, and apply the special case of 
(14.85) given by 


t = x 9 

B 

' = x ~c y ■ 


(14.97) 


In this case the coefficients in the transformed equation (14.92) are given by (14.93), 
where £(x, y) and rj(x , y) are given by (14.97). Evaluating these coefficients (recall A = 0 
here), we find that 

Ai = 0 , 

B 2 

Bl = --c* 0’ 

EB 

D\ = D, E l = D — , 
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Therefore in this case the transformed equation is 

r DC - EB\ du 

: 


B 2 \ d 2 u du ( l 


+ Fu = 0. 


Since ( — B 2 )/C / 0, we may write this in the form 


d 2 u du du 


dtdn di dti 


where the coefficients 


d = 


CD 

B r ’ 


DC - EB 


e = - 


B 


f = — 

J t> 2 


CF 

B 2 


are all real. This is again the canonical form (14.94). 

Finally, consider the case in which A = 0, B ^ 0, C = 0. In this case Equation (14.84) 
is simply 

_ d 2 u n du „ du 

B — — - — I- D — — 1 £- — h Fu = 0, 
dx dy dx dy 

and the identity transformation (14.98) reduces it to 

_ d 2 u „ du „du 

dl;dri dt drj ^ 

Since B # 0, we may write this in the form 

d 2 u du du 

~ d TT + e -£- + fu, 


dt; drj dt; drj 


where 


d = 


D 


e = — 


B ’ 


f = 


B 


This is again the canonical form (14.94). In effect, in this special case (A = 0, B # 0, 
C = 0) Equation (14.84) may be put in the canonical form (14.94) simply by trans- 
posing the appropriate terms and dividing by B. 

We now observe that the three special cases considered exhaust all possibilities for 
the hyperbolic equation (14.84). We first note that either A = 0 or A # 0. All cases in 
which A # 0 are covered by the first of the three special cases which we have 
considered. Turning to the cases in which A = 0, it would appear that the following four 
distinct possibilities deserve consideration: (a) B # 0, C # 0; (b) B # 0, C = 0; (c) B = 0, 
C # 0; and (d) B = 0, C = 0. We note that (a) and (b) are covered by the second 
and third of the three special cases which we have considered. Concerning (c) and 
(d), in both cases B 2 - 4 AC = 0, contrary to hypothesis. In particular, if (c) holds, 
Equation (14.84) is parabolic (not hyperbolic); and if (d) holds, Equation (14.84) is of 
the first order. 

We thus observe that the three special cases considered cover all possibilities for the 
hyperbolic equation (14.84). Thus there always exists a transformation (14.85) which 
transforms the hyperbolic equation (14.84) into one which may be written in the 
canonical form (14.94). Q.E.D 
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► Example 1 4.6 


Consider the equation 

d 2 u d 2 u d 2 u du du 
dx 2 + dx dy dy 2 + dx + dy 


9u = 0. 


(14.100) 


We first observe that B 2 — 4,4 C = 36 > 0 and so Equation (14.100) is hyperbolic. Since 
A # 0, we consider the transformation 


£ = A t x + y, 


tj = A 2 x + y. 


(14.95) 


where A, and A 2 are the roots of the quadratic equation A 2 + 4A — 5 = 0. We find that 
A t = 1 and A 2 = —5, and so the transformation (14.95) is 


I = * + y, 

y] = - 5x + y. 


(14.101) 


Applying (14.101) to Equation (14.100), we see that this equation transforms into 


,, 8 2 u du du 

- 36 ^ + 9 ar 27 ^- 9 “ = 0 ' 


Dividing by —36 and transposing terms, we obtain the canonical form 

d 2 u 1 du 3 du 1 
dtj = 4 ^" 4 ^" 4 U - 


C. The Parabolic Equation 

We now investigate the canonical form in the parabolic case and obtain the following 
theorem. 


THEOREM 14.4 


Hypothesis. Consider the second-order linear partial differential equation 


d 2 u 


d 2 u 


A dx 2 + B dx dy 


, d 2 u 


du 


du 


H- C — 2 + D — — h E — — h Fu — 0, 
dy dx dy 


(14.84) 


where the coefficients A, B, C, D , E , and F are real constants and B 2 — 4 AC = 0 so that 
the equation is parabolic. 

Conclusion. There exists a transformation 

£ = £(*, yl n = rj(x, y) (14.85) 

of the independent variables in (14.84) so that the transformed equation in the 



14.3 CANONICAL FORMS 751 


independent variables (£, rj) may be written in the canonical form 



d 2 u du du 

(14.102) 

where d , e , and / are rea/ constants. 

If A # 0 and C # 0, such a transformation is given by 



£ = Xx + y, 

n = y. 

(14.103) 

where X is the repeated real root of the quadratic equation 



AX 2 + BX + C = 0. 

(14.104) 

If A ^ 0 and C = 0, such a 

transformation is given by 



H 

II II 

(14.105) 


If A = 0 and C # 0, such a transformation is merely the identity transformation 

Z = x, 
rj = y . 


(14.106) 


Proof. We shall proceed in a manner similar to that by which we proved Theorem 
14.3. 

If A # 0 and C # 0, we apply the transformation (14.103) to obtain the transformed 
equation (14.92) with coefficients (14.93), where in this case f (x, y) and rj(x , y) are given 
by (14.103). Evaluating these coefficients we find that 

A t = AX 2 + BX + C, 

B i — BX + 2C, 

c 1 = c, 

= DX + £, = E, F l = F . 


Since A satisfies the quadratic equation (14.104), we see at once that A i = 0. Also, 
since B 2 - 4 AC = 0, X = —B/2A and so 


*1 


B 2 

2A 


+ 


2C = 


4AC - B 2 
2A 


= 0 . 


Thus in the present case the transformed equation (14.92) is 


8 2 u du du 

C^ + (DA + £)- + £- + Fu = 0. 
drj z oq drj 


(14.107) 


Since X is real, all coefficients in (14.107) are real; and since C # 0, we may write 
equation (14.107) in the form 
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where the coefficients 


d 


D1 + E 
C 



f=~ 


F 

C 


are all real. This is the canonical form (14.102). 

If A # 0 and C = 0, we apply the transformation (14.105) to obtain the transformed 
equation (14.92) with coefficients (14.93), where in this case g(x, y) and rj(x, y) are given 
by (14.105). Evaluating these coefficients, we obtain 


A t = C = 0, 

B i = B = 0 (since B 2 — AAC = 0 and C = 0), 

Ci = A ^ 0, 

D i = E, E { = D, F t = F. 

Thus in the case under consideration the transformed equation (14.92) is 

, d 2 u ^ du ^ du 
A — 2 + E — + D - — Fu = 0. 
drj d£ drj 

Since A ^ 0, we may write this in the form 

d 2 u du du 

W = d dt + e di + fu ’ 


where the coefficients d = — E/A, e = - D/A, and / = — F/A are all real. This is again 
the canonical form (14.102). 

Finally, consider the case in which A = 0 and C # 0. Since B 2 - AAC = 0, we must 
also have B = 0. Therefore in this case Equation (14.84) is simply 



+ D 


du 

dx 



“b Fu — 0, 


and the identity transformation (14.106) reduces it to 



+ D 


du _ du 
d£ drj 


+ Fu = 0. 


Since C # 0, we may write this in the form 


d 2 u du 

d^ = dl 


du 

+ e + 

drj 


fu, 


whered= —D/C,e= —E/C,f = —F/C. This is again the canonical form (14.102). We 
thus see that in this special case (A = 0, B = 0, C # 0) Equation (14.84) may be put in 
the canonical form (14.102) simply by transposing the appropriate terms and dividing 
by C. 

Finally, we observe that the three special cases considered exhaust all possibilities for 
the parabolic equation (14.84). For, either A = 0 or A # 0. If A # 0, either C # 0 or 
C = 0. These two possibilities are, respectively, the first and second special cases 
considered, and so all cases in which A ^ 0 are thus covered. If A = 0, either C # 0 or 
C = 0. The first of these two possibilities is the third special case considered. Finally, 
consider the situation in which A = 0 and C = 0. Since B 2 — AAC = 0, we must also 
have B = 0 and so Equation (14.84) reduces to a first-order equation. 
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Therefore there always exists a transformation (14.85) which transforms the 
parabolic equation (14.84) into one which may be written in the canonical form 
(14.102). Q.E.D . 


► Example 14.7 


Consider the equation 


d 2 u d 2 u d 2 u du du 
dx 2 dx dy + dy 2 "** dx dy 


u = 0. 


(14.108) 


We observe that B 2 — 4AC = 0, and so Equation (14.108) is parabolic. Since A # 0 
and C # 0, we consider the transformation 


£ = kx + y, 

n = y. 


(14.103) 


where X is the repeated real root of the quadratic equation X 2 — 6X + 9 = 0. We find 
that 1 = 3, and so the transformation (14.103) is 


£ = 3* + y, 
rj = y. 


(14.109) 


Applying (14.109) to Equation (14.108), we see that this equation transforms into 

. d 2 u du . du 

9 J? + 9 1T( + , T,- u = 0 - 

Dividing by 9 and transposing terms, we obtain the canonical form 

d 2 u du 1 du t 1 
Htf = ~di~ 3 ^ + 9 U ' 


D. The Elliptic Equation 

Finally, we prove the following theorem concerning the canonical form in the elliptic 
case. 


THEOREM 14.5 


Hypothesis. Consider the second-order linear partial differential equation 


d 2 u d 2 u 
dx 2 + dx dy 


„ d 2 u ^ du ^ du 
+ C — 2 + D — — h E - — h Fu = 0, 
dy L dx dy 


(14.84) 


where the coefficients A , B , C, D, E, and F are real constants and B 2 — 4 AC < 0 so that 
the equation is elliptic. 


Conclusion. There exists a transformation 

£ = £(x, y) 9 rj = rj(x , y) 


(14.85) 
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of the independent variables in (14.84) so that the transformed equation in the independent 
variables (£, rj) may be written in the canonical form 


d 2 u d 2 u du du 

W + W =d d£ + e hl + fU ’ 


(14.110) 


where d , e , and f are real constants. 
Such a transformation is given by 


£ = ax + y, 
rj = bx. 


(14.111) 


where a ± bi (a and b real , b / 0) are the conjugate complex roots of the quadratic 
equation 


AX 2 + BX + C = 0 


(14.112) 


Proof. Since B 2 - 4 AC < 0, we cannot have A = 0 in the elliptic case. Thus 
Equation (14.112) is a full-fledged quadratic equation with two roots. The condition 
B 2 — 4AC < 0 further shows that these two roots must indeed be conjugate complex. 

We apply the transformation (14.111) to obtain the transformed equation (14.92) 
with coefficients (14.93), where in this case £(x, y) and q(x 9 y) are given by (14.111). 
Evaluating these coefficients we find that 

A i = Aa 2 -+- i 3a + C, 

B t = 2Aab + Bb = b(2Aa + B) 

= Ab 2 / 0 (since /I/O, 6/0) 

D l = Da -f E, E x = Db , F t = F. 

Since a + bi satisfies the quadratic equation (14.112), we have 

A(a + bi) 2 + B(a + bi) + C = 0 

or 

[A(a 2 - b 2 ) + Ba + C] + [6(2/4a + B)]i = 0. 

Therefore 

4(a 2 - 6 2 ) + Ba + C = 0 
and 


6(2/4a + B) = 0. 


Thus 


i4^ = /Ifl 2 -b Ba -f C = /16 2 


and 


^ = 0 . 

Hence the transformed equation (14.92) is 


Ab 2 


f d 2 u d 2 u\ 
V^ 2 + dr] 2 ) 


+ {Da + E) + Db + Fu = 0. 

or] 


(14.113) 
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Since a and b are real, all coefficients in (14.1 13) are real; and since Ab 2 # 0, we may 
write Equation (14.1 13) in the form 


where the coefficients 


d 2 u d 2 u du du 
d£ 2 + drj 2 + 6 drj 


+ M 


d = 


Da + E 



are all real. This is the canonical form (14.1 10). 



Q.E.D. 


► Example 14.8 


Consider the equation 


d 2 u 8 2 u 8 2 u 

dx 2 + dx dy + dy 2 




-3u = 0. 


(14.114) 


We observe that B 2 - 4AC = — 16 < 0 and so Equation (14.114) is elliptic. We 
consider the transformation 


£ = ax + y, 
n = bx, 


(14.111) 


where a ± bi are the conjugate complex roots of the quadratic equation X 2 + 2X + 
5 = 0. We find that these roots are — 1 ± 2 i, and so the transformation (14.111) is 


£ = -x + y, 
Y] — 2x. 


(14.115) 


Applying (14.1 15) to Equation (14.1 14), we see that this equation transforms into 
. d 2 u A 8 2 u _ du ^ du 

4 W + 4 W~ 3 H + 2 dil~ 3U = 0 ' 

Dividing by 4 and transposing terms, we obtain the canonical form 


d 2 u d 2 u 3 du l du 3 

W + W = 4 ^ _ 2 ^ + 4 U ‘ 


E. Summary 


Summarizing, we list in Table 14.1 the canonical forms which we have obtained for the 
second-order linear partial differential equation 


d 2 u d 2 u 
d^+ -fody 


„ d 2 u _ du 

+ C — ^ A- D - — h E — - -f Eu = 0, 
dy~ dx dy 


where A, B , C, D, E, and F are real constants. 


(14.84) 
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TABLE 14.1 

Type of 

equation (14.84) 

Canonical form 

(where d , e , and f are real constants) 

hyperbolic: 

B 2 - 4 AC > 0 

d 2 u du du 

... = d + e + fu 
d^dq di \ dt] 

parabolic: 

B 2 - 4AC = 0 

d 2 u .du du 

2 = d-- + e—- + fu 
drj z d£ drj 

elliptic: 

B 2 - 4 AC < 0 

d 2 u d 2 u du du 

W + W = d M + e ei, + fu 


Exercises 


Transform each of the partial differential equations in Exercises 1-10 into canonical 
form. 


1. 


2 . 


d 2 u ^ 
dx 2 dx dy 


d 2 u 8 2 u 

+ v a 


d 2 u d 2 u 
dx 2 dx dy 

d 2 u ^ d 2u 
dx 2 dx dy 


A d 2tl A 

+ 4 5? = a 

. d 2 u n du 

~ *df 2+ = 


4 . 

5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 


d 2 u d 2 u 
dx 2 + dx dy 




d 2 u 

dx 2 dx dy 


d 2 u i ~d 2 u du 

+, v- 9 «r a 


d 2 u d 2 u 
dx 2 + dx dy 


d 2 u _ du 

+ t j “h 3 t — h 9w = 0. 
dy 21 


. 3 2 u d 2 u du 


d 2 u 

dx 2 + dx dy 


d 2 u .du c du 

* 3 r x + 5 e-y-°- 


d 2 u d 2 u 
dx 2 dx dy 


_ d 2 u 

+ 5 — 2 + u = 0. 
dy 2 - 


d 2 u 

dx 2 


d 2 u 


_ dudu 

dxdy + 5 d^~Ty + 3u = °- 


Show that the transformation 


Z = y- 


2 ’ 


n = x. 
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reduces the equation 


d 2 u d 2 u 

dx 2 + X dxdy 


+ x z 


8 2 u 

dy 2 


= 0 


d 2 u du 
d^ = dt' 


12. Consider the equation 


d 2 \i d 2 u d 2 u 

dS +{2x + 3) fcdj + 6 x W~ 


(A) 


(a) Show that for x = f , Equation (A) reduces to a parabolic equation, and 
reduce this parabolic equation to canonical form. 

(b) Show that for x # f , the transformation 

{ = y- 3x, 

rj=y-x 2 

reduces Equation (A) to 

2 — 

d 2 u drj 

dtfoj = 4(ri-Z)-9' 


14.4 AN INITIAL-VALUE PROBLEM; CHARACTERISTICS 
A. An Initial-Value Problem 


We shall now consider an initial-value problem for the second-order linear partial 
differential equation 


d 2 \i d 2 ^ 
dx 2 + dx dy + 



+ D 


du „ du 
te + E Ty 


+ Fu = 0, 


(14.84) 


where A, B, C, D , E, and F are real constants. In Chapter 4 we considered an initial- 
value problem for linear ordinary differential equations. Let us begin by recalling this 
problem for the second-order homogeneous linear ordinary differential equation with 
constant coefficients, 

d 2 v dv 

a + b-j- + cy = 0. (14.116) 

dx dx 


The problem is to find a solution / of (14.116) such that /(x 0 ) = c 0 and /'(x 0 ) = c l9 
where x 0 is some definite real number and c 0 and are arbitrary real constants. 
Interpreting this geometrically, the problem is to find a solution curve y = f(x) of 
(14.1 16) which passes through the point (x 0 , c 0 ) and whose tangent line has the slope c t 
at this point. 

Let us now attempt to formulate an analogous initial-value problem for the partial 
differential equation (14.84). In doing this we shall let ourselves be guided by the 
geometric interpretation of the initial-value problem for the ordinary differential 
equation (14.1 16). In this problem we must find a solution of Equation (14.1 16) which 
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satisfies certain supplementary conditions (the initial conditions). A solution of 
Equation (14.116) is a function / of the one real variable x, and this defines a curve 
y = f(x) (a solution curve) in the xy plane. In the analogous problem for the partial 
differential equation (14.84), we shall also seek a solution which satisfies certain 
supplementary conditions. But a solution of Equation (14.84) is a function 0 of the two 
real variables x and y, and this defines a surface u = 0(x, y) (a solution surface) in three- 
dimensional x, y, u space. Geometrically speaking, then, for the partial differential 
equation (14.84) we must find a particular solution surface u = </>(x, y) in x, y, u space. 
Now in the initial-value problem for the ordinary differential equation (14.116), there 
are two supplementary requirements which the solution curve y = /(x) in the xy plane 
must satisfy. First, the solution curve y = /(x) must pass through a prescribed point 
(x 0 , c 0 ) in the xy plane; and, second, this solution curve must be such that its tangent 
line has a prescribed slope c x at this point. What sort of analogous requirements might 
be imposed in an initial- value problem for the partial differential equation (14.84)? 
Instead of prescribing a point in the xy plane through which the solution curve must 
pass, we would prescribe a curve in x, y, u space through which the solution surface must 
pass. Instead of prescribing a slope for the tangent line to the solution curve at the 
prescribed point, we would prescribe a normal direction for the tangent plane to the 
solution surface along the prescribed curve. We thus formulate an initial-value problem 
for the partial differential equation (14.84) in the following geometric language. 

We seek a solution surface u = 0(x, y) of the partial differential equation (14.84) 
which (a) passes through a prescribed curve T (the initial curve) in x, y, u space, and (b) is 
such that its tangent plane has a prescribed normal direction at all points of the initial 
curve T. 

Let us now proceed to formulate this initial- value problem analytically. To do so, let 
us assume that the prescribed curve T has the parametric representation 

x = x(t), y = y(f), u = u(t) 

for all t on some real interval /. Now let t 0 e I and consider the corresponding point on 
T. This point has coordinates (x 0 , y 0 , u 0 ), where x 0 = x(t 0 ), )>o = y(t o)> u o = w(t 0 )- If 
the solution surface u = </>(x, y) is to pass through T at (x 0 , y 0 , w 0 ), then we must have 
u 0 = 4>(x 0 , y 0 ) or, equivalently, u(t 0 ) = 0[x(f o ), y(t 0 )]. Therefore the requirement that 
the solution surface u = </>(x, y) pass through the entire curve T is expressed by the 
condition 


u(t) = <£[x(f),y(t)] (14.117) 

for all t e /. 

We now consider the requirement that the tangent plane to the solution surface 
u = 0(x, y) have a prescribed normal direction at all points of T. Let us assume that 
this prescribed normal direction is given by [p(t), q(t ), - 1] for all t e /. Recalling that 
the normal direction to the tangent plane to u = 0(x, y) is given by 

I4> x (x, y), 4>y(x, y), - U* 


* Here and throughout the remainder of the chapter it is convenient to employ subscript notation for the 
various partial derivatives of <f>. Thus we denote 

d(f> d(f> d 2 4> 

- by K - by 4, y , — by 


d 2 <t> 

dydx 


by <f> xy , 


and 


d 2 d> 

V 


by <t>yy 
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we see that the requirement under consideration is expressed by the conditions that 

p(t) = y(t)l 

q(t) = 4> y [x(t), y(t)], 

for all t e /. 


(14.118) 


We have thus far said nothing concerning the nature of the functions x, y, u which 
define T or the functions p, q which prescribe the normal direction. Let us assume that 
each of these five functions is analytic for all t e I. We must now observe that the 
functions p and q cannot be chosen arbitrarily. For, from (14.1 17) we must have 

(0, l'(t)] + 4>ylx(t), l'(t)] 

for all t g /; and since (14.1 18) must hold for all t e /, this reduces to 


pU) d 41 +q(t) d yu 


dt 


dt 


dt 


(14.119) 


for all t g /. Thus the functions p and q are not arbitrary; rather they must satisfy the 
identity (14.119). 

In our attempt to formulate the initial-value problem analytically, we have thus 
introduced a set of five functions defined by {x(r), y(t), u{t ), p(t), q(t )}, where (1) x = 
x(t), y = y(t), u = u(t) is the analytic representation of a prescribed curve T in x, y , u 
space, (2) [p(t), q(t), — 1] is the analytic representation of a prescribed normal direction 
along T, and (3) these five functions satisfy the identity (14.119). Such a set of five 
functions is called a strip , and the identity (14.1 19) is called the strip condition. 

We may now state the initial-value problem in the following way. 


Initial-Value Problem. 

equation 

8 2 u 


Consider the second-order linear partial differential 


A~ R ^ 2yL 

dx 2 + dx dy 


_ d 2 u ^du ^8u 

-f C — 2 + D — — f- E — — h Fu = 0, 
dy ox dy 


(14.84) 


where A, B, C, D, £, and F are real constants. Let x(t), y(t ), u(t ), p(t), and q(t) denote five 
real functions, each of which is analytic for all t on a real interval /. Further, let these 
five functions be such that the condition 




(14.119) 


holds for all t e I. Let R be a region of the xy plane such that [x(t), y(f )] e R for all t e /. 

We seek a solution 4>(x, y) for the partial differential equation (14.84), defined for all 
(x, y) e R, such that 

0O(t), J'(t)] = u(t), 

and 

</> x [x(t), y(t)] = p(t), 

4> y lx(t), J'(t)] = q(t), 

for all t e I. 


(14.117) 

(14.118) 
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► Example 14.9 


Consider the partial differential equation 


d 2 u d 2 u d 2 u 

dx 2 + dx dy dy 2 


and the five real functions defined by 


x(t) = t , y(t) = t 2 , u(t) = t 4 + 2 t 2 , p(t) = 4 t, q(t) = It 2 


(14.120) 


(14.121) 


for all real t. We observe that these five functions satisfy the strip condition (14.119); 
indeed we have 


At 3 + 4r = (4t)(l) + (2t 2 )(2t) 

for all real t. We may therefore interpret the set of five functions defined by (14.121) as a 
strip and state the following initial-value problem. 

We seek a solution </>(x, y) of the partial differential equation (14.120), defined for all 
(x, y), such that 

0(r, t 2 ) = t 4 + It 2 , 

( f )x (t,t 2 ) = 4t, (14.122) 

cj) y {t, t 2 ) = It 2 , 

for all real t. 

Geometrically, we seek a solution surface u = 0(x, y) of the partial differential 
equation (14.120) which (a) passes through the space curve T defined by 

x = t, y = t 2 , u = t 4 + It 2 

for all real t, and (b) is such that its tangent plane has the normal direction {At, It 2 , - 1) 
at all points of T. 

We now observe that a solution of this problem is the function </> defined by 

(j)(x, y) = lx 2 + y 2 

for all (x, v). In the first place, this function (p satisfies the partial differential equation 
(14.120). For we find that 

4>x*{x, y) = 4, 4>y X (x, y) = 0, and <p yy (x, y) = 2, 

so that 


<t>xx(x, y ) + 4>yx(x, y) - 2 4> yy (x, y) = 0 

for all (x, y). In the second place, the conditions (14.122) are satisfied. Clearly 
cj){t, t 2 ) = It 2 -f t 4 for al) real t. From </> x (x, y) = 4x, we see that 0 x (t, t 2 ) = At for all 
real V, and from 0 y (x, y) = 2y, we see that cf) y (t , t 2 ) = It 2 for all real t. 

We shall now examine the initial-value problem from a slightly different viewpoint, 
again employing useful geometric terminology in our discussion. Suppose, as before, 
that the curve T through which the solution surface u = </>(x, y) must pass is defined by 

x = x{t), y = y(t), u = u(t) (14.123) 

for all t on some real interval /. Then the projection of T on the xy plane is the curve T 0 
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defined by 

x = x(t), y = y(t) (14.124) 

for all t e /. Employing this curve r 0 , we may express the initial-value problem in the 
following form. 

We seek a solution <j>(x, y) of the partial differential equation (14.84) such that 0, 0 X , 
and (f)y assume prescribed values given by u(t), p(t), and q(t ), respectively, for each t e J, 
or, in other words, at each point of r 0 [where we assume that the five functions given 
by x(t), y(t), u(f), p(t), and q(t) satisfy the strip condition (14.119)]. 

We shall find that this interpretation of the initial-value problem will be useful in the 
discussion which follows. 


B. Characteristics 

Continuing our discussion of the initial-value problem, let us now assume that the 
problem has a unique solution 0(x, y) defined on a region R (of the xy plane) which 
includes the curve T 0 . Let us also assume that at each point of T 0 the solution </>(x, y) 
has a power series expansion which is valid in some circle about this point. Now let 
t 0 e I and let x 0 = x(t 0 ), y 0 = y(t 0 ), so that (x 0 , y 0 ) is a point of r 0 . Then for all (x, y) 
in some circle K about (x 0 , y 0 ) we have the valid power series expansion 

</>(x, y) = 0(x o , y 0 ) + [0 x (x o , y 0 )(x - x 0 ) + <^(x 0 , y 0 )(y - y 0 ) ] 

+ Y\ J'oK^ - *o) 2 + 2 (t> X y(x 0 , J'oKx - x 0 )(j> - yo) (14.125) 
+ <l>yy(Xo,yo)(y - J'o) 2 ] + 

The solution </>(x, y) will be determined in the circle K if we can determine the 
coefficients 


<t>(*o>yo\ 4>x(xo,y 0 ) 9 <t>y(x 0 ,y 0 ), <t> xx (x 0 ,y <>)>•• • 

in the expansion (14.125). 

Now we already know the first three of these coefficients. For the initial-value 
problem requires that 0, 0 X , and assume prescribed values given by u(t ), p(t), and 
q(t ), respectively, at each point of T 0 . Therefore at the point (x 0 , y 0 ) at which t = t 0 , we 
have 


4>(x 0 ,y 0 ) = w(t 0 ), 
0x(^o ? Lo) = P(to)> 


and 


4> y (x 09 y 0 ) = q(t 0 ). 

Let us now attempt to calculate the next three coefficients in (14.125). That is, let us 
see if we can determine 


<t>xx(x 0 ,yo% <M*o> 3>o), <t>yy(x 09 y 0 ). (14.126) 

To do this, we need to know conditions which these numbers must satisfy. One such 
condition is readily available: it can be obtained at once from the partial differential 
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equation (14.84)! For, since </>(x, y) is a solution of (14.84) in the region R , we have 
Mxx(x, y) + B4> xy (x, y) + C(j) yy (x, y) = - D<p x (x, y) - E<j> y (x, y) - F<p(x, y) 
for all (x, y) e R. Thus, since (x 0 , y 0 ) e R , we have the condition 


^o) “b B(fi x y(x o, 3^o) "1” C&yy{x 0> .Vo) 

= -£<£x(*o> .Vo) - E(j)y{x o, y 0 ) - F(j)(x 0 , y 0 ). (14.127) 

Now along the curve T 0 we know that 


</> x [x(t), y(t)] = p{t) 
and 

</>y[>(0> y(0] = fl(0- 

Differentiating these identities with respect to t, we find that along the curve T 0 we have 

dx dy dp 

4>xxlx(t), ^(t)] + 4>xylx(t), y(t)] -Jt = ~di' 

4>xy[x(t), y(t)] ^ + 4>„lx{t), y(t)] ^ = Yt’ 

Since (x 0 , y 0 ) is a point of T 0 , we thus obtain the two additional conditions 

x'{t 0 )(f> xx {x 0 , y 0 ) + y'(t 0 )4> xy {x o, y 0 ) = p'(fo) (14.128) 

and 


Ato)<l>xy(Xo> yo) + ?(to)<l>yy(Xo, ^o) = 0'(*o)> (14.129) 

i #/ v #/ v #/ , , „ v t , r dx dy dp dq . , 

where x (r 0 ), y (t 0 ), p ( t 0 ), and q (t 0 ) denote the values of and — , respectively, 

dt dt dt dt 

at t = to . 

Let us examine more closely the three conditions (14.127), (14.128), and (14.129). 
They are actually a system of three linear algebraic equations in the three unknown 
numbers (14.126); all other quantities involved are known numbers! A necessary and 
sufficient condition that this system have a unique solution is that the determinant of 
coefficients be unequal to zero (Section 7.5C, Theorem B). Therefore in order to have a 
unique solution for the three coefficients (14.126), we must have 


A 


Mb) = 


x'(t 0 ) 


0 


B 

/(«o) 

Ab) 


C 

0 

/(to) 


*0. 


Conversely, if A(t 0 ) ^ 0,wecan determine the three coefficients (14. 126) from the linear 
system consisting of Equations (14.127), (14.128), and (14.129). 

We have assumed that the solution <p(x, y) has a power series expansion of the form 
(14.125) at each point of the curve r 0 . Thus if the determinant 


A (t) = 


A B 
dx dy 
dt dt 

dx 

It 


C 


dy 

dt 


0 
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at all points of r 0 , we can determine the values of (p xx , 4> xy , and (/) yy all along r 0 . We can 
then attempt to find the values of the higher derivatives of <f>{x, y) along r 0 , employing 
a procedure similar to that by which we found the values of the three second 
derivatives. One can show that if A(t) ^ 0 at all points of r 0 , then the values of these 
higher derivatives can be determined all along r 0 . In this manner we find the 
coefficients in the series expansion of 4>{x, y) along r 0 . 

It can be shown that if A(r) ^ 0 at all points of r 0 , then there exists a unique solution 
of the initial-value problem. On the other hand, if A(r) = 0 at all points of r o , then 
there is either no solution or infinitely many solutions. 

Let us now consider a curve r 0 at all points of which 


A B 


A(t) = 


dx 

It 


0 


dy 

dt 

dx 

li 


C 


dy 

dt 


Expanding this determinant, we thus see that 


A 



B^if + C 

dt dt 



= 0 


at all points of r 0 . Therefore T 0 must be a curve having an equation £(x, y) = c 0 which 
satisfies 

A dy 2 — B dx dy + C dx 2 = 0. (14.130) 


We thus investigate the solutions of Equation (14.130). We shall divide this 
discussion into the following three subcases: (a) A / 0; (b) A = 0, C # 0; and (c) 4 = 0, 
C = 0. 


Subcase (a): A ^ 0. In this case if follows from (14.130) that the curve T 0 is defined 
by y = <t>(x), where (j>(x ) satisfies the ordinary differential equation 


A 




+ C = 


0 . 


From this equation we see that 

dy B + - 4AC dy B — - 44C 

dx 2A ° r dx 2 A 


Thus if A # 0 the curve T 0 must be a member of one of the families of straight lines 
defined by 

[£ + JB 2 -4AC1 H 3- JB 2 - 4AC~ 1 , , 

y = I ~*2A + or ) ;= ~2A x + (14.131) 

where c t and c 2 are arbitrary constants. 


Subcase (b): A = 0, C # 0. In this case it follows from (14.130) that T 0 is defined 
by x = 9(y), where 6(y) satisfies the ordinary differential equation 
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From this equation we see that 

dx B dx 

— = - or — = 0. 

dy C dy 

Thus if A = 0 and C # 0, the curve T 0 must be a member of one of the families of 

straight lines defined by 

x = — y + c t or x = c 2 , (14.132) 

where c t and c 2 are arbitrary constants. 


Subcase (c): A = 0, C = 0. In this case it follows at once from (14.130) that the 
curve r 0 must be a member of one of the families of straight lines defined by 

y = c i or x = c 2 , (14.133) 

where c t and c 2 are arbitrary constants. 

Thus if T 0 is a curve at all points of which A (t) = 0, then T 0 must be one of the 
straight lines defined by ( 1 4. 1 3 1 ),( 1 4. 1 32), or ( 1 4. 1 3 3). Any such curve T 0 in the xy plane 
along which A (t) = 0 is called a characteristic (or characteristic base curve) of the partial 
differential equation (14.84). In other words, a characteristic of (14.84) is any curve T 0 in 
the xy plane having an equation £(x, y) = c 0 which satisfies 

A dy 2 — B dx dy + C dx 2 = 0. (14.130) 

We have shown that the characteristics of the partial differential equation (14.84) are 
straight lines. Specifically, if A / 0, they are the straight lines defined by (14.131); if 
A = 0 and C # 0, they are those defined by (14.132); and if A = 0 and C = 0, they are 
given by (14.133). 

Let us consider the connection between the characteristics of the partial differential 
equation (14.84) and the initial-value problem associated with this equation. Recall that 
in this problem we seek a solution </>(x, y) of (14.84) such that </>, </> x , and 4> y assume 
prescribed values given by u(t ), p{t ), and q(t ), respectively, at each point of a curve T 0 (in 
the xy plane) defined by x = x(t), y = y(t) for all t on some real interval /. 

Further recall that if 


A 

B 

c 

dx 

dy 

0 

dt 

dt 

0 

dx 

dy 

It 

dt 


is unequal to zero at all points of r 0 , then this initial- value problem has a unique 
solution; but if A{t) = 0 at all points of T 0 , then there is either no solution or infinitely 
many solutions. Now we have defined a characteristic of (14.84) as a curve in the xy 
plane along which A (t) = 0. Thus we may say that if the curve r 0 is nowhere tangent to 
a characteristic, then the initial-value problem has a unique solution; but if T 0 is a 
characteristic, then the problem has either no solution or infinitely many solutions. 

We now examine the characteristics of Equation (14.84) in the hyperbolic, parabolic 
and elliptic cases. We shall find it convenient to consider once again the three subcases 
(a) A # 0, (b) A = 0, C # 0, and (c) A = 0, C = 0. 
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Subcase (a): A # 0. In this case the characteristics of (14.84) are the families of 
straight lines defined by (14. 131). If Equation (14.84) is hyperbolic, then B 2 — AAC > 0 
and (14.131) defines two distinct families of real straight lines. If (14.84) is parabolic, 
then B 2 — AAC = 0, the two families of straight lines coincide, and (14. 131) defines only 
one family of real straight lines. If (14.84) is elliptic, then B 2 — AAC < 0, the lines 
defined by (14.131) are not real, and there are no real characteristics. 

Subcase (b): A = 0, C # 0. In this case the characteristics are defined by (14.132). 
Since A = 0, B 2 - AAC = B 2 > 0 and so the elliptic case cannot occur. In order for 
Equation (14.84) to be hyperbolic, we must have B # 0 and then (14.132) defines two 
distinct families of real straight lines. In the parabolic case, we must have B = 0 and 
then the two families coincide, so that (14.132) defines only one family of real straight 
lines. 

Subcase (c): A = 0, C = 0. In this case we must have B # 0 in order for Equa- 
tion (14.84) to be of the second order. But then B 2 — AAC = B 2 > 0, and so 
Equation (14.84) must be hyperbolic. The characteristics are given by (14.133) and 
hence are two distinct families of real straight lines. 

Let us summarize what we have thus found concerning the nature of the 
characteristics of Equation (14.84). If Equation (14.84) is hyperbolic, then the charac- 
teristics consist of two distinct families of real straight lines. If Equation (14.84) is 
parabolic, then the characteristics consist of a single family of real straight lines. If 
Equation (14.84) is elliptic, then there are no real characteristics. 

We observe the significance of these results as they relate to the initial-value problem 
associated with Equation (14.84). Recall that if the curve T 0 is nowhere tangent to a 
characteristic, then the initial-value problem has a unique solution. In the elliptic case 
there are no real characteristics and so this requirement concerning T 0 is automatically 
satisfied. Thus if Equation (14.84) is elliptic, the associated initial-value problem has a 
unique solution. In the hyperbolic and parabolic cases real characteristics exist; they 
are certain straight lines. Thus if Equation (14.84) is hyperbolic or parabolic, the 
associated initial-value problem has a unique solution if the curve T 0 is nowhere 
tangent to one of the straight lines which is a characteristic. 


► Example 14.10 


Find the characteristics of the equation 


d 2 u d 2 u 3 2 w 

dx 2 + dx dy dy 2 


(14.120) 


We first note that A = 1, B = 1, C = — 2, and B 2 — AAC = 9 > 0. Therefore 
equation (14.120) is hyperbolic and so the characteristics consist of two distinct 
families of real straight lines. Let us proceed to find them. We have observed that they 
satisfy the equation 

A dy 2 — B dx dy + C dx 2 = 0. 


For Equation (14.120), Equation (14.130) reduces to 

dy 2 — dx dy — 2 dx 2 = 0, 


(14.130) 



766 PARTIAL DIFFERENTIAL EQUATIONS 


and so the characteristics of Equation (14.120) are the solutions of 



From this we see that 


dy_ 

dx 


- dy 
2 or - 
dx 


- 1 . 


Thus the characteristics of Equation (14.120) are the straight lines of the two real 
families defined by 

y = 2x + c y and y=— x + c 2 > (14.134) 

where c i and c 2 are arbitrary constants (see Figure 14.3). 

In Example 14.9 we considered an initial-value problem associated with Equa- 
tion (14.120). We sought a solution </>(x, y) of Equation (14.120), defined for all (x, y), 
such that 

4>(t, t 2 ) = t* + it 2 , 

4> x (t, t 2 ) = At, (14.122) 

4> y (t, t 2 ) = 2 1 2 , 

for all real t. The curve T 0 defined for all real t by x = t, y = t 2 is tangent to a 
characteristic (14.134) of Equation (14.120) only at t = 1 and t = Thus on any 
interval a < t < b not including either of these two values, we are assured that this 
initial-value problem has a unique solution. Recall that in Example 14.9 we observed 
that a solution valid for all (x, y) is the function 0 defined by tj)(x, y) = lx 2 + y 2 . 



Figure 14.3 
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In Section 14.3 we proved that Equation (14.84) can always be transformed into an 
equation having a so-called canonical form. This reduction involved a transformation 

i = Z(x, y\ rj = rj(x , y) (14.85) 

of the independent variables in (14.84). In Theorems 14.3, 14.4, and 14.5 we listed 
specific transformations of this form which led to the desired reduction in the 
hyperbolic, parabolic, and elliptic cases, respectively. The resulting canonical forms 
were given by (14.94), (14.102), and (14.1 10) in these three respective cases. The specific 
transformations listed for the various cases are directly related to the characteristics of 
the Equation (14.84). We shall not attempt to determine the exact nature of this 
relation in all cases; rather we shall restrict our investigation to the case in which 
Equation (14.84) is hyperbolic and A # 0. 

Theorem 14.3 states that if Equation (14.84) is hyperbolic and A # 0, then a specific 
transformation which leads to the desired canonical form is given by 


£ = X t x + y, 
t] = X 2 x + y, 

where X { and X 2 are the roots of the quadratic equation 

AX 2 + BX + C = 0. 
More explicitly, this transformation may be written 



— B — yjB 2 — 4AC~ 

x + y. 

2A 


~ — B + y/B 2 — 4AC~ 

x + y. 

rj = 

2 A 


(14.95) 

(14.96) 


(14.135) 


Turning to the characteristics of Equation (14.84) in the case under consideration, we 
note that they are the two distinct families of straight lines defined by 


_ \B + JB 2 - 4AC~ 




2A 


and 


_ \B- JB 2 - 4 AC 




2A 


X + c x 


X + c 2 , 


(14.131) 


where c t and c 2 are arbitrary constants. Comparison of (14.135) and (14.131) clearly 
indicates the relation between the canonical form-producing transformation and the 
characteristics in this case. For here the transformation is given by £ = £(x, y), 
rj = rj(x , y), where £(x, y) = c u rj(x , y) = c 2 are the two families of characteristics of 
Equation (14.84). 


► Example 14.11 


Let us again consider the equation 


d 2 u d 2 u d 2 u _ 

dx 2 + dx dy dy 2 


(14.120) 


Since Equation (14.120) is hyperbolic and A # 0, a canonical form-producing 
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transformation is given by ^ = £(x, y ), rj = rj(x , y), where £(x, y) = c u rj(x , y) = c 2 are 
the two families of characteristics of Equation (14.120). In Example 14.10 we found 
that the characteristics of Equation (14.120) are the straight lines defined by (14.134). 
Therefore the two families of characteristics are given by 

y — 2x = c t and y + x = c 2 , 

and a canonical form-producing transformation of Equation (14.120) is given by 

f = y - 2x, 


Applying (14.136) to Equation (14.120), we see that this equation transforms into 

d 2 u d 2 u 

— 7 = Q and hence into — — = 0, (14.137) 

drj d£ dri 

which is the canonical form in the hyperbolic case. We observe that the particular 
characteristics y = 2x and y = -x transform into the coordinate axes { = 0 and 
rj = 0, respectively, in the plane. 

We note that we may obtain a solution of Equation (14.137) involving two arbitrary 
functions merely by integrating first with respect to £ and then with respect to g. We 
thus obtain the solution 


u = f(t) + g(fi ), 

where / and g are arbitrary functions of their respective arguments. Using (14.136) to 
return to the original independent variables x and y, we obtain the solution 

u = f(y- 2x) + g(y + x) 

of Equation (14.120) involving the two arbitrary functions / and g. We point out 
that this solution could be obtained directly by the procedure outlined in Section 14.1. 

Exercises 


1. Consider the partial differential equation 


d 2 u 
dx dy 


= 0 


(A) 


and the five real functions defined for all real t by 

x(t) = t, y(t) = 3 — t, u(t) = t , 
p(t) = t, q(t) = t — 1. 

(a) Show that these five functions satisfy the strip condition (14.119). Then 
consider the initial-value problem in which one seeks a solution 0(x, y) of 
partial differential equation (A), defined for all (x, y) such that 


0(t, 3 — t) = t, 


4> x (t, 3 -t) = t, 

<t>y(t , 3 - t) = t ~ 1, 

for all real t. Interpret this problem geometrically. 
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(b) Show that the function 4> defined for all (x, y) by 

x 2 y 2 3 

</>(*, y) = y- y + 2y-- 

is a solution of the initial-value problem considered in part (a). 
2. Consider the partial differential equation 

d 2 u d 2 u „ 


and the five real functions defined for all real t by 

x(t) = t, y(t) = 0, u(t) = t 4 + t 2 , 
p(t) = A 1 3 -f 2 1, q(t) = At. 

(a) Show that these five functions satisfy the strip condition (14.119). Then 
consider the initial-value problem in which one seeks a solution 0(x, y) of 
partial differential equation (B), defined for all (x, y), such that 

0(t, 0) = t 4 + t 2 , 

0 x (f, 0) = 4t 3 -f 2t, 

4>y{u 0) = At 

for all real t. Interpret this problem geometrically. 

(b) Show that the function 0 defined for all (x, y) by 

0(x, y) = x 4 + 6x 2 y 2 + y 4 + x 2 + y 2 + 4xy 

is a solution of the initial-value problem considered in part (a). 

For each of the partial differential equations in Exercises 3 through 6, determine 
whether or not real characteristics exist. For those equations for which families of real 
characteristics exist, find these families and sketch several members of each family. 


d 2 u d 2 u d 2 u 
dx 2 dx dy dy 2 

d 2 u 8 2 u d 2 u 
2 d^~ 2 d^ + 5 dP = 0 ' 


d 2 u . d 2 u . d 2 u 

d^ + 4 d7d~y + 4 W = 

d 2 u d 2 u d 2 u 

2 d^ + 7 d^y + 6 W = 
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APPENDIX ONE 

Second - and Third-Order Determinants 


Determinants played an important role on a number of occasions in several chapters. 
Although certain definitions and theorems involved nth-order determinants, the 
relevant illustrative examples and exercises were concerned almost entirely with 
second- and third-order determinants. In this brief appendix, we discuss the evaluation 
of these two simple cases. 

The second-order determinant denoted by 

flu fl 12 
a 2\ a 22 

is defined to be 


^ 11^22 ~ a 12 a 21 * 


Thus, for example, the second-order determinant 


is given by 


6 5 
-3 2 


( 6 )( 2 ) — ( 5 )( — 3 ) = 27 . 


The third-order determinant denoted by 


flu 

a l2 

a l3 

a 2 1 

a 22 

a 23 

a 3l 

a 32 

a 33 
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may be defined as 


„ a 22 a 23 a 2l a 23 a 21 a 22 

a ll „ „ ~ a l2 „ „ + a l3 „ „ , (2) 

a 32 a 33 a 3l a 33 a 3l a 32 

where each of the three second-order determinants involved is evaluated as explained 
in the preceding paragraph. Thus, for example, the third-order determinant 

1 2 -3 

3 4-2 (3) 

-15 6 


is given by 


3-2 ,^34 

-1 6 + — 1 5 


= (1)E(4)(6) - ( — 2)(5)] - (2)[(3)(6) - (-2)(-l)] + (-3)[(3)(5) - (4)(- 1)] 

= (24 + 10) - 2(18 - 2) - 3(15 + 4) = -55. 

The expression (2) for the third-order determinant (1) is sometimes called its ex- 
pansion by cofactors along its first row. Similar expansions exist along each remaining 
row and along each column, and each of these also gives the value defined by (1). We list 
these other five expansions for evaluating (1): 


a l2 

<*13 

, 011 

013 

011 

fl 12 



+ 022 


~ 023 


a 32 

a 33 

031 

033 

031 

fl 32 

a l2 

013 

011 

013 

. fl ll 

012 



— 032 


+ 033 


a 22 

023 

021 

023 

021 

022 

a 22 

023 

012 

013 

. 012 

013 



— 021 


+ 031 


a 32 

033 

032 

033 

022 

023 

a 2 1 

023 

. 011 

013 

a u 

013 



+ 022 


— 032 


a 3l 

033 

031 

033 

021 

023 

a 2 1 

022 

a n 

012 

, n a H 

012 



— 023 


+ 033 


a 3l 

032 

031 

032 

021 

022 


For example, we evaluate the determinant (3) using the expansion (7), its so-called 
expansion along its second column. We have 

3-2 1-3 1-3 

- (2) -l 6 +(4) -l 6- (5) 3 -2 

= — (2)[(3)(6) - ( — 2)(— 1)] + (4)[(1)(6) - ( — 3)( — 1)] - (5)[(l)(-2) - (-3)(3)] 

= -2(18 - 2) + 4(6 - 3) - 5( — 2 + 9) = -55. 


Of course, one does not need all six of the expansions (2), (4), (5), (6), (7), and (8) to 
evaluate the determinant (1). Any one will do. In particular, the expansion (2) which we 
gave initially is quite sufficient for the evaluation. However, we shall see that under 
certain conditions it may happen that a particular one of these six expansions involves 
considerably less calculation than the other five. 
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We now state and illustrate certain results that facilitate the evaluation of 
determinants. We first have: 

Result A. If each element in a row (or column) of a determinant is zero, then the 
determinant itself is zero. 

Result B. If each element in a row (or column) of a determinant is multiplied by the 
same quantity c, then the value of the determinant itself is also multiplied by c. 

Thus, for the third-order determinant (1), if each element in its second row is 
multiplied by c, we have 


<*1 1 

<*12 

<*13 

<*11 

<*12 

<*13 

ca 2 1 

ca 21 

ca 23 = c 

<*21 

<*22 

<*23 

<*31 

a 32 

<*33 

<*31 

<*32 

<*33 


For example, we have 


1 

2 

— 3 | 

1 

2 -3 

1 

2 

-3 

12 

16 

-8 = 4(3) 

4(4) 4( — 2) =4 

3 

4 

-2 

-1 

5 

6 1 

1 -1 

5 6 

-1 

5 

6 


Result C. If two rows (or columns) of a determinant are identical or proportional, 
then the determinant is zero. 

For example, 

1 2 4 

-12 5 = 0 , 

3 6 12 

because the first and third rows of this determinant are proportional. 

Result D. (1) Suppose each element of the first row of a third-order determinant is 
expressed as a sum of two terms, thus 

<*n = bn + c u , a l2 = b 12 + c 12 , ^13 = b 13 + c 13 . 

Then the given determinant can be expressed as the sum of two determinants, w'here the 
first is obtained from the given determinant by replacing a Y 1 by b Y l9 a l2 by b i2 , and a l3 
by b 13 , and the second is obtained from the given determinant by replacing a n by c n , 
a l2 by c l2 , and a l3 by c 13 . That is, 


b u +c n 

b 12 + C 12 

b 13 + C 13 


b 11 

b l2 

b 13 


c n 

C 12 

C 13 

<*21 

<*22 

<*23 

= 

<*21 

<*22 

<*23 

+ 

<*21 

<*22 

<*23 

<*31 

<*32 

<*33 


<*31 

<*32 

<*33 


<*31 

<*32 

<*33 


(2) Analogous results hold for each of the other rows and for each column. 
For example, we see that 


3 + 5 

6 + 2 

7+ 1 


3 

6 

7 

5 

2 

1 

4 

8 

-2 

= 

4 

8 

-2 + 

4 

8 

-2 

9 

-4 

-7 


9 

-4 

-7 

9 

-4 

-7 
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and similarly that 


-2 1 + 8 

3 5-3 

7 

-1 

= 

-2 1 

3 5 

7 

-1 

+ 

-2 8 7 

3-3-1 

4 6 + 9 

5 


4 6 

5 


4 9 5 


Result E. The value of a determinant is unchanged if each element of a row (or 
column) is multiplied by the same quantity c and then added to the corresponding 
element of another row (or column). 


Thus, for the third-order determinant (1), if each element of the second row is 
multiplied by c and then added to the corresponding element of the third row, we have 


a il 

a l2 

a l3 


a ll 

a l2 

a l3 

a 2l 

a 22 

a 23 

= 

a 2l 

a 22 

a 23 

«31 

a 32 

a 33 


a 3l + Ca 2l 

a 32 T ca 22 

a 33 + Ca 23 


For example, we have 


5 7-1 


5 7 -1 


5 7-1 

1 4 -2 


1 4 -2 

= 

1 4 -2 

6-5 8 


6 + (3)(1) — 5 + (3)(4) 8 + (3)( — 2) 


9 7 2 


Result E is very useful in evaluating third-order determinants. It is applied twice to 
introduce two zeros, one after the other, in a certain row (or column). Then when one 
employs the expansion of the determinant along that particular row (or column), two 
of the second-order determinants in the expansion are multiplied by zero and hence 
drop out. Thus the given third-order determinant is reduced to a multiple of a single 
second-order determinant. We illustrate by evaluating determinant (3) in this manner, 
introducing two zeros in the first column and then using expansion (6) along this 
column: 


1 2 

-3 


1 

2 

-3 

3 4 

-2 

= 

3 + (—3)(1) 

4 + ( — 3)(2) 

— 2 + (— 3)(— 3) 

-1 5 

6 


-1 +(1)(1) 

5 + (1)(2) 

6 + (!)( — 3) 


1 2 -3 
0-2 7 

0 7 3 



-2 7 
7 3 



= (-2)(3) - (7)(7) 55. 


-3 

7 
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X 

In x 

X 

In x 

X 

In x 

X 

In x 





6.0 

1.792 

9.0 

2.197 

0.1 




6.1 

1.808 

9.1 

2.208 

0.2 

■ 

3.2 

1.163 

6.2 

1.825 

9.2 

2.219 

0.3 


3.3 

1.194 

6.3 

1.841 

9.3 

2.230 

0.4 

- 0.916 

3.4 

1.224 

6.4 

1.856 

9.4 

2.241 

0.5 

- 0.693 

3.5 

1.253 

6.5 

1.872 

9.5 

2.251 

0.6 

- 0.511 

3.6 

1.281 

6.6 

1.887 

9.6 

2.262 

0.7 

- 0.357 

3.7 

1.308 

6.7 

1.902 

9.7 

2.272 

0.8 

- 0.223 

3.8 

1.335 

6.8 

1.917 

9.8 

2.282 

0.9 


3.9 

1.361 

6.9 

1.932 

9.9 

2.293 

1.0 



1.386 

7.0 

1.946 

10 

2.303 

1.1 


4.1 

1.411 

7.1 

1.960 

20 

2.996 

1.2 

0.182 

4.2 

1.435 

7.2 

1.974 

30 

3.401 

1.3 


4.3 

1.459 

7.3 

1.988 

40 

3.689 

1.4 

0.336 

4.4 

1.482 

7.4 

2.001 

50 

3.912 

1.5 

■ 

4.5 

1.504 

7.5 

2.015 

60 

4.094 

1.6 


4.6 

1.526 

7.6 

2.028 

70 

4.248 

1.7 

0.531 

4.7 

1.548 

7.7 

2.041 

80 

4.382 

1.8 

0.588 

4.8 

1.569 

7.8 

2.054 

90 

4.500 

1.9 

0.642 

4.9 

1.589 

7.9 

2.067 

100 

4.605 

2.0 

0.693 

5.0 

1.609 

8.0 

2.079 



2.1 

0.742 

5.1 

1.629 

8.1 

2.092 



2.2 

0.788 

5.2 

1.649 

8.2 

2.105 



2.3 


5.3 

1.668 

8.3 

2.116 



2.4 

0.875 

5.4 

1.686 

8.4 

2.128 



2.5 

0.916 

5.5 

1.705 

8.5 

2.140 



2.6 

0.956 

5.6 

1.723 

8.6 

2.152 



2.7 

0.993 

5.7 


8.7 

2.163 



2.8 


5.8 


8.8 

2.175 



2.9 

■9 

5.9 

HQIIJI 

8.9 

2.186 
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X 

e x 

e x 

X 

e x 

e x 

0.0 

1.000 

1.0000 

3.0 

20.086 

0.0498 

0.1 

1.105 

0.9048 

3.1 

22.198 

0.0450 

0.2 

1.221 

0.8187 

3.2 

24.533 

0.0408 

0.3 

1.350 

0.7408 

3.3 

27.113 

0.0369 

0.4 

1.492 

0.6703 

3.4 

29.964 

0.0334 

0.5 

1.649 

0.6065 

3.5 

33.115 

0.0302 

0.6 

1.822 

0.5488 

3.6 

36.598 

0.0273 

0.7 

2.014 

0.4966 

3.7 

40.447 

0.0247 

0.8 

2.226 

0.4493 

3.8 

44.701 

0.0224 

0.9 

2.460 

0.4066 

3.9 

49.402 

0.0202 

1.0 

2.718 

0.3679 

4.0 

54.598 

0.0183 

1.1 

3.004 

0.3329 

4.1 

60.340 

0.0166 

1.2 

3.320 

0.3012 

4.2 

66:686 

0.0150 

1.3 

3.669 

0.2725 

4.3 

73.700 

0.0136 

1.4 

4.055 

0.2466 

4.4 

81.451 

0.0123 

1.5 

4.482 

0.2231 

4.5 

90.017 

0.0111 

1.6 

4.953 

0.2019 

4.6 

99.484 

0.0101 

1.7 

5.474 

0.1827 

4.7 

109.947 

0.0091 

1.8 

6.050 

0.1653 

4.8 

121.510 

0.0082 

1.9 

6.686 

0.1496 

4.9 

134.290 

0.0074 

2.0 

7.389 

0.1353 

5 

148:4 

0.00674 

2.1 

8.166 

0.1225 

6 

403.4 

0.00248 

2.2 

9.025 

0.1108 

7 

1096.6 

0.00091 

2.3 

9.974 

0.1003 

8 

2981.0 

0.00034 

2.4 

11.023 

0.0907 

9 

8103.1 

0.00012 

2.5 

12.182 

0.0821 

10 

22026.5 

0.00005 

2.6 

13.464 

0.0743 




2.7 

14.880 

0.0672 




2.8 

16.445 

0.0608 




2.9 

18.174 

0.0550 







ANSWERS TO 

ODD-NUMBERED EXERCISES 


Section 1 .1 

1. Ordinary; first; linear. 

3. Partial; second; linear. 

5. Ordinary; fourth; nonlinear. 
7. Ordinary; second; linear. 

9. Ordinary; sixth; nonlinear. 


Section 1 .2 

5. (a) 2,3, -2. (b) -1, -2,4. 


Section 1 .3 

1. No; one of the supplementary conditions is not satisfied. 
3. (a) y = 3e* x + 2e~ 3x . (b) y= -2e~ 3x . 

5. y = 2x — 3x 2 + x 3 . 

7. Yes. 


Section 2.1 

1. 3x 2 + 4 xy + y 2 = c. 

3. x 4" x 2 y + 2 y 2 = c. 

5. 3x 2 y + 2y 2 x - 5x - 6 y = c. 

7. y tan x + sec x + y 2 = c. 

9. s 2 — s = ct. 

11. x 2 y - 3x + 2y 2 = 7. 

13. y 2 cos x — y sin 2 x = 9. 

15. — 3y + 2x + y 2 = 2 xy. 

17. (a) A = |; 2x 3 + 9x 2 y + 12y 2 = c. 
(b) A = -2; 2x 2 — 2y 2 — x = cxy 2 . 


777 
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19. (a) x 2 y + c. (b) 2x l y 3 - \x 2 y 4 + c. 
21. (b) x 2 . (c) x 4 + x 3 y 2 = c. 


Section 2.2 

1. (x 2 + 1 ) 2 y = c. 

3. r 2 + s = c( 1 - r 2 s). 

5. r sin 2 6 = c. 

7. (x + 1 ) 6 (y 2 + 1) = c(x + 2) 4 . 

9. y 2 + xy = cx 3 . 

y 

11. sin — = cx. 
x 

13. (x 2 + y 2 ) 312 = x 3 In cx 3 . 

15. x + 4 = (y + 2) 2 e~ iy+l) . 

17. 16(x + 3)(x + 2) 2 = 9(>; 2 + 4) 2 . 

19. (2x + y) 2 = 12 (y — x). 

23. (a) x 3 — y 3 + 6 xy 2 = c. 

(b) 2x 3 + 3 x 2 y -f 3xj; 2 = c. 


Section 2.3 

1. y = x 3 + cx“ 3 . 

3. y = (x 3 + c)e~ 3x . 

5. x = 1 + ce l,t . 

7. 3(x 2 + x)y = x 3 — 3x + c. 

9. y = x _1 (l + ce~ x ). 

11. r = (6 -f c)cos 6. 

13. 2(1 + sin x)y = x + sin x cos x + c. 
15. y = (1 + cx -1 ) -1 . 

17. y = (2 + cc -8 * 2 ) 1/4 . 

19. y = x 4 — 2x 2 . 

21. y = (e x + l) 2 . 

23. 2 r = sin 26 + sj2 cos 6. 

25. x 2 / = x 4 +15. 


f 2(1 - e~ x ), 

0 < x < 1, 

y \2(e - l)e~ x , 

x > 1. 

\ e~ x (x + 1), 

0 < x < 2, 

y [2e- x + e- 2 , 

x > 2. 


icc ^ x 

31. (a) y = -r + ce~ bxla if ), # b/a; 

b — a/. 


y = 


kxe- bxla 


a 


+ ce~ bxla if 1 = b/a. 


35. (b) y = sin x — cos x + sin 2x — 2 cos 2x + ce~ x . 
37. (a) 2x sin y — x 2 = c. 

(b) y 2 + 2y + ce~ x2 — 1 = 0. 

39. y = (x — 2 + ce~ x )~ l + 1. 

41. y = (2 + ce~ 2x2 )~ l + x. 
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Section 2.3. Miscellaneous Exercises 

1. (x 3 -h l) 2 = |c.y|. 

3. xy + 1 = c(x + 1). 

5. (3x — y) 2 = \c(y — x)|. 

7. y = f + c(x 4 + l) -2 . 

9. x*y 2 — x 3 y = c. 

11. (2x + 3)>) 2 = \c(y — x)|. 

13. y = 1 + c(x 3 + l) -2 . 

15. y = (x 2 + 3x) 1/2 . 

17. y = e~*(2x 2 + 4) 1/2 . 

19. (j> 2 + l) 2 = 2x. 

21. y = (x 2 + l) 1/2 /2. 

23. (x + 2)}? = x 2 + 8, 0 < x < 2, 
(x + 2)y = 4x + 4, x > 2. 


Section 2.4 

1. 4x 5 ^ + 4x 4 ’y 2 + x 4 = c. 

3. x_y 2 e JC + ye x = c. 

5. x 3 y 4 (xy + 2) = c. 

7. 5x 2 + 4xy + y 2 + 2x + 2y = c. 


9. 


ln[c(x 2 + y 2 — 2x + 2y + 2)] + 4 arctan 



= 0 . 


1 1. 12x 2 + 16x_y + 4^ 2 + 4x + 8y - 89 = 0. 

13. x + 2y - ln|2x + 3)> — 1 1 - 2 = 0. 

21. (a) y = cx + c 2 . (b) y = — x 2 /4. 


Section 3.1 

1. x 2 + 3 y 2 = k 2 . 

3. x 2 + y 2 — In y 2 = k. 

5. x = y — 1 + ke~ y . 

7. x 2 y + y 3 = k. 

9. y 2 = 2x — 1 + ke~ 2x . 

11. x 2 + y 2 = ky. 

13. n = 3. 

15. ln(x 2 + y 2 ) + 2 arctan(j/x) = c. 

17. ln|3x 2 + 3 xy + 4y 2 | - (2/ N /39)arctan[(3x + S^/^/^x] = c. 


Section 3.2 

1. (a) v = 8(1 — e~ 4 '), x = 2(4 1 - 1 + e~ 4 '). 

(b) 8 ft/sec; 38 feet. 

3. Rises 10.92 feet. 

5. (a) v = 245(1 - <T‘ /25 ), x = 245[t + 25(e~' /25 - 1)]. 
(b) t = 5.58 sec. 

7. (a) 10.36 ft/sec. (b) 13.19 ft/sec. 

_ 256 [91 + 59e~ ,/4 ] 

' V ~ 91 - 59e~ ,/4 ' 
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11 . 


V = 


75(1 -<T 8 ^) 
1 + e _8V5 ' 


13. 16.18 feet. 

15. (a) 10.96 ft/sec. (b) 20.46 ft/sec. 
17. 4.03 ft/sec. 

19. v 2 = — (k/m)x 2 + i?o + (k/m)x q. 


Section 3.3 

1. (a) 59.05%. (b) 2631 years. 

3. (a) 50^/2 grams. (b) 12.5 grams, (c) 10.29 hours. 
5. 63.5 years. 

7. (a) 9 times original number; (b) 10.48 hours. 

9 (a) x - ( 10 > 6 • 

y. W A — j 4 e 59.4-3t/100> 


(b) 312,966; (c) 1,000,000. 

_ (10) 5 [3e 15(l98O_,)/(1O,4 - 2] 

X _ 6^1 5(1980 -f)/(10) 4 _ j 


13. (a) $1822.10. (b) approx. 11.52 years. 
15. (a) 112.311b. (b) 17.33 minutes 
17. (a) 318.53 1b. (b) 2.74 minutes. 

19. 11,179.96 grams. 

21. (a) 0.072%. (b) 1.39 minutes. 

23. (a) 63.33 °F. (b) 13.55 minutes. 

25. 7.14 grams. 


Section 4.1 B 

7. (c) y = 3e 2x — e 3x . 
9. (c) y = 4x — x 2 . 


Section 4.1 D 

1. y = c x x + c 2 x 4 . 

3. y = c t x + c 2 (x 2 + 1). 

5. y = c t e 2x + c 2 (x + 1). 

9. y p = -f - 2x + 3e x . 

Section 4.2 

1. y = c t e 2x + c 2 e 3x . 

3. y = c Y e xl2 + c 2 e 5x/2 . 

5. y = c t e x + c 2 e~ x + c 3 e 3x . 

7. y = (c t + c 2 x)e* x . 

9. y = ^ 2x (c x sin 3x + c 2 cos 3x). 
11. y = c x sin 3x + c 2 cos 3x. 

13. y = (c t + c 2 x)e x + c 3 e 3x . 

15. y = (c t + c 2 x + c 3 x 2 )e 2x . 
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17. y = c t e x + c 2 sin x + c 3 cos x. 

19. y = c i + c 2 x + c 3 x 2 + (c 4 + c 5 x)e x . 

21. y = c l e 2x + c 2 e 3x + e~ x (c 3 sin x + c 4 cos x). 

23. y = e^ x,2 ^c x sin + ° 2 cos 

+ ^ _>/2 x/ 2 ^C3 sin ^~y~ + ^4 cos 

25. )> = 2e 4x + e~ 3x . 

27. )> = —e 2x + 2e* x . 

29. )> = (3x + 2)e~ 3x . 

31. y = ( 13x + 3)e~ 2x . 

33. y = e 2x sin 5x. 

35. y = e~ 3x (4 sin 2x + 3 cos 2x). 

37. y = 3e“ (1/3) *[sin |x + 2 cos §x]. 

39. y = e x — 2e 2x + e 3x . 

41. y = e~ x — ^ e 2x + f xe 2 *. 

45. y = sin x + c 2 cos x + e~ x (c 3 sin 2x + c 4 cos 2x). 


Section 4.3 

1. y = c t e x + c 2 e 2x + 2x 2 + 6x + 7. 

3. y = e~ x (c t sin 2x + c 2 cos 2x) + 2 sin 2x — cos 2x. 


5. 


y = e x (c! sin^/^x + c 2 cos^/^x) + 


sin 4x 
26 


3 cos 4x 
52 


7. y = Cl e x + c 2 e 5x + |( e x + e 5 *). 

9. y = c t e x + c 2 e~ 2x -f c 3 e~ 3x + 3x 2 + x + 4. 

v- . - - - 9 sin 2x 2 cos 2x _ 2 

11. 3; = c x c + e (c 2 sin 2x + c 3 cos 2x) — 1 — * 2x 


_9x _ 82 
“T"25* 

13. y = c t e 2x + c 2 e~ 3x + 2xe 2 * — 3e 3x + x + 2. 

15. y = c t e~ x + (c 2 + c 3 x)e 2x + 2e x — 2xe~ x . 

2x 3 

17. y = c t + c 2 sin x + c 3 cos x + — 4x — 2x sin x. 

X 2 £ x 3 x £* 

19. y = c t e * + c 2 e 2x + c 3 e 3x + — H — — — I- 4xe 2x + e* x . 

21. y = Ci sin x + c 2 cos x — *x 2 cos x + |x sin x. 

x 4 4x 3 19 e 3x 

23. = c t + c 2 x + c 3 e x + c A e 3x — — x 2 + 2x 2 e x — 9xe x + — . 

2 3 6 27 

25. y= -6e x + 2e 3x + 3x 2 + Sx + 10. 

27. = 3e 3 * - 2e 5x + 3xe 2x + 4e 2 *. 

29. y = 4xe~ 4x + 2e~ 2x . 
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31. 31 = 
33. 31 = 
35. y = 
37. 31 = 

39. y = 

41- }>„ = 
43. y p -- 
45. = 


47. y p 
49. y p 

51. y P 

53. y p 
or 


Section 4.4 

1. 31 = 
3. y = 

5- 3' = 

1. y = 

9. y = 
11. y = 
13. 31 
15. y 


17. 31 
19. y 

21. 3' 

23. 31 
25. y 


Section 4.5 

1. y = 
3. y = 


e _2x (f sin 3x — 2 cos 3x) 4- 2e~ 2x . 
j[^ 2x (sin 3x 4- 2 cos 3x) 4- sin 3x 4- 3 cos 3x]. 

(x 4- 5)e x 4- 3x 2 e* + 2xe 2x — 4e 2x . 

6 cos x — sin x + 3x 2 — 6 4- 2x cos x. 

le~ x 3\e 2x ( 3xe x 5e x sin x 
~20 40 * 4~ + ~4 KP 

= Ax 3 4- Bx 2 4- Cx 4- D + Ee~ lx . 

- Ae~ lx 4- Bxe~ 2x sin x 4- Cxe~ 2x cos x. 

= Ax; 2 e _3x sin 2x 4- Bx 2 e~ 3x cos 2x + Cxe~ 3x sin 2x 
4 - Dxe~ 3x cos 2x 4- Ex 2 e~ 2x sin 3x 4- Fx 2 e~ lx cos 3x 
-f Gxe~ 2x sin 3x 4- Hxe~ lx cos 3x + le~ 2x sin 3x 4- Je _2x cos 3x. 

= Ax*e 2x 4- Bx 3 e 2x + Cx 2 e 3x 4- Dxe 3 * 4- Ee 3x . 

- Ax 3 sin 2x 4- Bx 3 cos 2x 4- Cx 2 sin 2x 4- Dx 2 cos 2x 4- Ex sin 2x 
Fx cos 2x + Gx 5 e 2x 4- Hx*e 2x 4- Ix 3 e 2x -h Jx 2 e 2x 4- Kxe 2x . 

= Ax 4 sin x + Bx 4 cos x + Cx 3 sin x + Dx 3 cos x + Ex 2 sin x 
4 - Fx 2 cos x. 

= A 4- Bx sin 2x 4- Cx cos 2x 4- Dxe x 4- Exe~ x 

> p = Ax sin 2 x 4 - Bx cos 2 x 4- Cx sin x cos x 4- Dx sinh x + £x cosh x. 


c t sin x 4- c 2 cos x 4- (sin x)[ln |csc x - cot x|]. 
c Y sin x 4- c 2 cos x 4- (cos x)[ln |cos x|] 4- x sin x. 


sin 2x 


• _ Jill z-a, _ . 

sin 2x + c 2 cos 2x H — [In |sec 2x + tan 2x|] 


1 

4' 


e 2 *(c 1 sin x + c 2 cos x) + xe 2x sin x + (In |cos x\)e 2x cos x. 
1 


Ci + c 2 x + 


2x 


,-3x 


c t sin x + c 2 cos x + [sin x][ln |csc x - cot x|] 

— [cos x][ln | sec x + tan x|]. 

c t e~ x + c 2 e~ 2x + ( e~ x + e~ 2 *)[ln (1 + e*)]. 

COS 2 X 

c, sin x 4 V c 2 cos x 4- (sin x)[ln (1 4- sin x)] — x cos x — : — 

1 1 1 4- sin x 


Cye x 4- c 2 e 


-2x 


-he x In | x | - e 2x 



CyX 2 4- C 2 x 5 - 3x 3 - fx 4 . 

c t (x 4- 1) 4- c 2 x 2 - x 2 - 2x 4- x 2 In |x|. 

c t e x -h c 2 x 2 -h x 3 e x — 3x 2 e x . 


c t sin x 4- c 2 x sin x 4- — sin x. 


CyX -h c 2 x 3 . 

CyX l/2 -h C 2 X 312 . 
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17. y = c t sin(ln x 2 ) + c 2 cos(ln x 2 ) + 


5. 

y = 

7. 

y = 

9. 

y = 

11. 

y = 

13. 

y = 

15. 

y = 

17. 

y = 

19. 

y = 

21. 

y = 

23. 

y = 

25. 

y = 

27. 

y = 

29. 

y — 


, 2 \ 


•x 1 / 3 . 
n x)x 1/3 . 
c 2 + C 3 X 3 . 

n v\ v 3 1 /, 


x In x 2 


19. y = (c i + c 2 In x)x + c 3 x 2 + : 


1 (x X' 

+ l 


- ,nx + s)' 


29. y = cflx — 3) + c 2 (2x — 3) 3 . 


4x 

25' 


Section 4.6 

1. (a) /,(x) = 2e x - e 2 *, f 2 (x) = -e x + e 2 *. (b) 5/,(x) + 7/ 2 (x). 


Section 5.2 

cos 16f 

1. x = — - — ; i (ft), 7t/8 (sec), S/n oscillations/sec. 


3. (a) x = — — — . (b) i (ft), n/4 (sec), 4/n oscillations/sec. 

(c) t = ^ + ^-(n = 0,1,2,...). (d) t = ^ + ™(n = 0,1,2,...). 

„ , . sin lOt cos lOt >/34. r , ,, „ , .... 

5. (a) x = 1 — . (b) -*yj- (ft); rt/5 (sec); 5/n oscillations/sec. 

(c) 0.103 (sec); —3.888 ft/sec. 

7. 18 lb. 


Section 5.3 

1 d 2 x dx 

1- (a) - + 2 — + 20x = 0, x(0) = j, x'(0) = 0. 

, - 4t f sin 8t cos 8t\ 

(b)x=e "(—+—} 

(c) x = e~ M cos(8( — </>), where 4> * 0.46. 


(d) n/4 (sec). 
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3. x = (6 t+i)e~ 


5. (a) x = e 


„./sinl6 t 2cosl6t\ J5 _ 8 . ... ... 

8( l — 1 l;x = -^-e 8 cos(16t — </>), where 


< p x 0.46. 

(b) 7t/8 (sec); n. (c) 0.127. 
7. (a) x = (j + 2 t)e~ 8 ‘. 


= e 4< f sin 4^/3 1 + ^ cos 4,/3 A. 


(C) X = f + 24 V ^ ) e<- ‘ 6 + 8v/?>< + ( 3 2 ^ -^ ( ~ 16 ~ 8v/?>< - 

9. (a) 64. (b) 8^3. 


Section 5.4 

1. x = 


cos 12 1 — cos 20 1 


3. x = — 2te 8t + - 


5. x = e 8t ( — sin A x /lt + cos 4 x /2 1 ) + sin At - cos At. 

V 2 V V ) 

7. x = e~ 2t ^ — ^ — — 2 cos 4 tj + sin 2f + 2 cos 2t,0 <t <n; 

., , ,/3 sin 4t . \ 

x = (e 2 * - l)e 2 '( — 1- 2 cos 4t\,t > n. 


9. (a) x = cos It — cos St. 


Section 5.5 

, ,,2^2 sin 4j$t -cos 4^31) 

1. (a)— ;x- jg 


72 sin A^Jlt + cos 4 x /2f 


t sin 8r 


_ 1/ J1U 0 1/ 

(b) 8; x = - . 

3. (b) 2/n; 3 V / 5/4. (c) 722/2*; 15^/23/23. 


Section 5.6 

1. i = 4(1 — e~ 50 '). 

I _ p-soot 

3. q = - ;i- 10<r ! °°'. 

5. i = e _80, ( — 4.588 sin 60t + 1.247 cos 60t) - 1.247 cos 200t + 1.331 sin 200t. 
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7. q-- 

i = 

Section 6.1 

1. )> = 

3. y = 
5. y = 

7. y = 
9. y = 
11. y = 
13. y = 
15. y = 

17. y = 

Section 6.2 

1. x = 
3. x = 

5. y = 
7. y = 
9- y — 



r a (-i)"x 2 " i r « (-1)"* 

C °l + „?, [2-4-6---(2n)]J + Cl [ X + „? 1 [3-5-7---| 


2n+ 1 

(2 n + 1)] 


x 2 x 4 


Co,1 -y-24 + - +Ci x -y-30 + 


,3 v 4 Ux 5 


C 0 1 -X 2 + — + 


40 


+ ... 


( X 3 X 4 X 5 \ 

+c X~y~t + t + '"} 

( x 3 3x 5 \ / X 3 x 4 

H T + W + ') + c X~~6~n + 


3x 5 

40 


+ 


x 3 x 6 


Coll+ T + 18 + '"J + Cl l X + T + 252 


x’ 17x 7 


1+ Z 


,2n 


,2n 


= X — • 

4 - 0 . 


, [2 • 4 • 6 • • • (2n)] „ = o 2"n! 

. , 7x 3 x 4 21x 5 

2 + 3)t -_- T + _ + ", 

(x - l) 2 (X - l) 3 5(x - l) 4 (x - l) 5 

c n | 1 - 2 — - — — + - — — + 2 — t + 


c 0 [l- ! 

+ cj(x- 1)- 


2 

\2 


12 

\3 


+• ••• 


2 6 

2 + 4(x - 1) - 4(x - l) 2 + 


(x - l) 2 + (X - l) 3 _ (X - l) 5 + 


12 




2 , 4(x - l) 3 (x - l) 4 _ 2(x - l) 5 


3 


15 


+ 


0 and x = 3 are regular singular points. 

1 is a regular singular point; x = 0 is an irregular singular point. 


ClX( 1 "l 4 + 616 




,2 x 4 

2 ' 40 


_ 4/3/1 3x 2 9x 4 \ ^ 2/3/1 3x 2 9x 4 \ 

' ( 1 --i^ + 896-") + c > x T-r + mo-"} 

_ 1/^/1 3x 2 9x 4 \ 

c ' x '{'-l6 + m- -) + c 


1/3 , 3x 2 9x 4 

2 x 1/3 | 1 - — + ^r- 


8 320 
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11. y 
13. y 
15. y 

17. y 

19. y 

21. y = 

23. y = 

25. y = 

Section 6.3 

3. 31 = 

Section 7.1 

1. x = 

3. x = 


3x 2 

c ^ l+x + lo + ' 

„ 1/0/ . X 2 X 4 

^'I'-T + Tio 


7x 5x 2 


+ C 2 x 1/3 ( 1 + 4^ + 


12 36 


+ 


_ 1/0/ . x x* 

+ c ^ 'l'-y + 24 - 


(2m + 3)] 


[ 00 ~~ I I f” 00 -y* ^ 

1 + .?,2 v] + C! T + .?,Ew^ 

444-)M'44H- 


] 


X 2 X 3 


Ci ( 1 — x + 6~ + 


+ C?x 1 


, 3x 2 x 3 

1+ --T + 


Ce x + C 2 x l e x 9 where C = C x — C 2 . 

29x 2 


L 


+ C 


_ , / 1 x 29x 2 \ 1 

{* (“2“2 + H4" + -J+JJ’.MtaW • 


where j^x) denotes the solution of which C l is the coefficient. 


10 


+ c { x ‘4^T-y + ") + 5 ),,(x|ln|)t| } 

where 34 (x) denotes the solution of which C x is the coefficient. 


oo 

:.i+I 

n = 1 


( — l)"2"x" 

( n !) 2 


+ C 


, 4x — 


3x 2 + 


22x 3 


+ J»iM In |x| 


} 


where ^(x) denotes the solution of which C x is the coefficient. 

H4£ p- 4- ££)]» ] 


+ c 


V _ 

L 4 


3x 5 llx 7 
-J~ A A -j- 


+ yi(x) In | x | 




128 13824 

where ^(x) denotes the solution of which C x is the coefficient. 


c x sin x + c 2 cos x 



ce 


-It 2 

, y = -ice 


2t + - |e ( . 


ce 


- 3 / 



2ce 3t 



e x 

2' 


3 
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5. x = c l sin t + c 2 cos t, 

/3 c t + c 2 \ . fey - 3 c 2 
v = - -x sin t + — - r ~ 


cos t + — — — 
2 2 


7. x = c l e' /E ‘ + c 2 e-' / *‘ -t+i, 


i t 1 


6 6 3 


^ , sin t c x e l sin t 

9. x-c,e — 

11. x = CiC 4 ' + c 2 e~ 2 ' - t + 1, y = - c t e 4t + c 2 e~ 2 ' + t. 

13. x = Ci + c 2 c -2 ' + It 2 + t, y = (1 — c t ) — 3 c 2 e -2 ' — t 2 — 3 1. 
15. x = c t e‘ + c 2 c -2 ' — te‘; 

y = (3 - CiV ~ ic 2 e~ 2 ‘ + te‘. 


17. x = c l e 3 ' + ---,y = c 2 e‘ - 


t 5 c l e 3t t 4 


19. x = Ci + c 2 t + c 3 e‘ + c 4 c ' — 


y = (Ci — c 2 + 1) + (c 2 + l)f + c 4 e ' — 


21. x = Ci + c 2 e' + c 3 e 3t — 


t 2 14 1 
IT 9~’ 


/ 17\ , _ 3 , t 2 4t 

y = ( 3Ci + Yj + c 2« - 3c 3 c - y - y- 

23. 


— - = -2x l + 3x 2 + t 2 . 
at 


2, 


- = -2t 3 x 2 — tx 3 + 5t 4 . 


Section 7.2 

1. = 2 cos t — cos 2t, x 2 = 4 cos t + cos 2t. 


3. «i = 


10c _ 10001 5e- 4000 ' 

3 3 


+ 5, i 2 = 


10c" 10001 5c- 4000 ' 5 

1 1 - — . 

3 6 2 


5. x = c _,/1 ° + 2c"' /30 , y = —2e~ tll ° + 4c ‘ 
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Section 7.3 

1. (c) x = 2e 5t — e~\ y = e 5t + e ~ x . 

Section 7.4 

1. x = c t e x + c 2 e 3t , y = 2c t e x + c 2 e 3t . 

3. x = 2c l e* t + c 2 e~ t , y = 3 c^ 4 ' — c 2 e~ l . 

5. x = c^e 1 + c 2 e 5t , y = — 2c 1 e f + 2c 2 e 5 '. 

7. x = 2 + c 2 e - *, y = + c 2 e“'. 

9. x = c^e 4 ' + c 2 e~ 2t , y = c t e 4t - c 2 e~ 2t . 

11. x = 2 e t { — c l sin 2 1 + c 2 cos 2f), y = £*(£! cos 2 1 + c 2 sin 2r). 

13. x = cos 3 1 + c 2 sin 3 1), y = e t (c l sin 3 1 — c 2 cos 3 1). 

15. x = 2 e 3t (c t cos 3 1 + c 2 sin 3 1), 

y = e 3 '[c 1 (cos3f + 3 sin 3 1) + c 2 (sin3f — 3cos3f)]. 

17. x = e 3t (c t cos 2 1 + c 2 sin 2f), y f= e 3 '(Ci sin 2r — c 2 cos 2t). 
19. x = c^e 1 + c 2 te x , y = 2c^ f + c 2 (2t — l)e f . 

21. x = — 2c i e 3t + c 2 (2t + l)e 3 *, y = c t e 3t — c 2 te 3t . 

23. x = 2e 5t -f le~ 5t , y = 2c 5f — 3e“ 5 *. 

25. x = 4c 4f [cos 2f — 2 sin 2t], y = e 4 '[cos 2f + 3 sin 2*]. 

27. x = 2e 4 ' - 8 te 4r , y = 3e 4 ' - 4te 4r . 

31. x = 3c^ 4 + c 2 f -1 , y = 2c t f 4 - c 2 t ~ l . 


Section 7.5A 

/9 0 9\ 

1. (b) 1 4 2. 


/ -4 12 — 20\ 

2. (b) ( —24 8 0 . 



3c 3 ' \ 

6. (b) (i) (6t+U)e 3t ; 

^(3t 2 + 2 t)e 3t J 


/ (e 3 ‘ - l)/3 

(ii) [e 3t (6t + 7) - 7]/9 
\[^ 3f (9f 2 - 6t + 2) - 2]/27 
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Section 7.5 B 
1. AB = 

3. AB = 

5. AB = 

7. AB = 

9. AB = 

11. A 2 = 

13. A -1 = 
15. A -1 : 

17. A~ 1 = 

19. A“ 1 : 

21. A~ 1 = 

23. A -1 = 

Section 7.5C 

3. (a) k 


/ 22 
\18 

23\ 

13/ 

BA = 

/ 18 62\ 
V 7 17/ 

/ 7 


4 

0\ 


-7 


8 

-14 

I, BA = ( 

\ 17 


-4 

16/ 


/ 7 





9 

1 


BA not defined. 

\io 

4 / 





' 19 
-20 


2 ' 
12 ; 


I 42 

14 



1 

7 

5\ 

34 

15 

9 

, BA = | 

-8 

4 

8 

l 6 

-4 

~ 2 I 


\ 6 

36 

50/ 


3 5\ 

— 4 8 , BA not defined. 

0 -a) 


4 

4 ^ 


fl ~ 6 

2l\ 

3 

8 

, A 3 = 

12 1 

-3 

4 

- 1 / 


^ 6 12 

V 



( 1/16 — 1/8\ 

V — 3/8 -1/4/ 

1-1/5 2/5 -3/5\ 
1-1 1 . 
— 6/ 5 7/5 -3/5/ 

1—2 19/3 l/3\ 

0 -2/3 1/3 . 

\ 1 -7/3 -1/3 / 

/ 4 1 -6\ 

1/2 1/2 -1 . 

\— 3 "I 5/ 

1 -1/5 2/5\ 

-2 1 -1 , 

0 -1/5 2/5) 


= 3, (b) k = 2. 
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Section 7.5 D 

1. Characteristic values: — 1 and 4; 

Respective corresponding characteristic vectors: 



where in each vector k is an arbitrary nonzero real number. 
3. Characteristic values: — 1 and 6; 

Respective corresponding characteristic vectors: 



where in each vector k is an arbitrary nonzero real number. 
5. Characteristic values: 7 and — 2; 

Respective corresponding characteristic vectors: 



where in each vector k is an arbitrary nonzero real number. 
7. Characteristic values: 1, 2, and —3; 

Respective corresponding characteristic vectors: 



where in each vector k is an arbitrary nonzero real number. 
9. Characteristic values: 2, 3, and — 2; 

Respective corresponding characteristic vectors: 



where in each vector k is an arbitrary nonzero real number. 
1 1. Characteristic values: 1, 3, and 4; 

Respective corresponding characteristic vectors: 



where in each vector k is an arbitrary nonzero real number. 
13. Characteristic values: —1,2, and 3; 

Respective corresponding characteristic vectors: 



where in each vector k is an arbitrary nonzero real number. 



ANSWERS TO ODD-NUMBERED EXERCISES 791 


Section 7.6 

1. x t = c^e' + c 2 e 3 \ x 2 = Ic^e 1 + c 2 e 3t . 

3. x t = 2 c t e 4t + c 2 e~\ x 2 = 3 c^ 4 ' — c 2 e~\ 

5. x t = c^e 1 + c 2 e 5 \ x 2 = - 2c x e l + 2 c 2 e 5t . 

7. X! = c t e 4t + c 2 e~ 2 \ x 2 = c t e 4t - c 2 e~ 2t . 

9. x t = 2 e t ( — c l sin 2 1 + c 2 cos 2 1), x 2 = e t (c l cos 2r + c 2 sin 2t)- 

11. Xi = e t (c l cos 3 1 + c 2 sin 3t), x 2 = e t (c 1 sin 3 1 — c 2 cos 3 1). 

13. X! = c^e 1 + c 2 te\ x 2 = 2c^ f + c 2 {2t - l)e*. 

15. X! = -2c^ 3t + c 2 (2t + l)e 3 \ x 2 = c t e 3t - c 2 te 3t . 


17. Xi = 2c t e 4t + c 2 (2f + l)e 4 ', x 2 = c x e M + c 2 te 4t . 


Section 7.7 

1. x t = c^e 1 + c 2 e 2t + c 3 e~ 3t 9 
x 2 = c Y e l + 2 c 2 e 2t + lc 3 e~ 3t 9 
x 3 = 4- + llc 3 e~ 3t . 

3. x x = + c 2 e 3t + c 3 e~ 2 \ 

T# — 2f 

£3 £? 

x 3 = — Cje 2 ' — c 2 e 3 ' + 4c 3 e -2 '. 

5. x x = c^ 5 ' + 2c 2 e~\ 

x 2 = Cie 5 ' - c 2 e~' + c 3 e~ 3 ', 
x 3 = Cie s ‘ — c 2 e~‘ — c 3 e -3 '. 

7. x, = -2c 1 e <2+ ' /3)t + 2c 2 e <2 “' /3) ', 

x 2 = (l +y5)c 1 e <2+ ' /3) ‘ + (-l +75)c 2 e (2 -' /3) ', 
x 3 = c 3 e 2 '. 

9. x t = c t e~' + c 3 e 3t , 
x 2 = 2c l e~ t + c 2 e 3 ', 
x 3 = — 2c l e~‘ — c 3 e 3t . 

11. Xi = c y e 4t + c 2 e', 

x 2 = 2c l e* t + 3 c 2 e‘ + 3 c 3 e‘, 
x 3 = c t e 4t + c 2 e‘ + c 3 e‘. 

13. x t = c t e~‘ + 2c 2 e 2 ', 
x 2 = c t e~‘ + 3c 3 e 2t , 
x 3 = — c v e~} — c 2 e 2t — c 3 e 2 '. 

15. Xj = c 2 e l + 2c 3 e 4 ', 
x 2 = c 2 e 3t — c 3 e 4t , 
x 3 = c x e l + 2 c 2 e 3 ' + c 3 e 4 ‘. 

17. x t = c t e‘ + c 3 e~ 2 ', 



x 2 

= —20^ + c 2 e 2i 

■ + C 3 (J 

- \)e- 2 \ 


*3 

ro 

1 

<N 

1 

(N 

0 

1 

II 

- 2t 


19. 

Xi 

= Ci€~* + c 3 e 3 '. 




x 2 

— -2c l e~‘ + c 2 e 3 ' 

+ c 3 (t 

- l)e 3 ', 


*3 

— —c 2 e 3t — c 3 te 3 '. 



21. 

*1 

= c^e 2 ' + c 3 te 2 ‘. 




x 2 

= c 2 e 2 ' — 2 c 3 te 2t . 




*3 

= —2 c t e 2t — 3 c 2 e 2t + c 3 (4t + l)e 2t . 

23. 

Xl 

= 2c 1 e 2t + 2 c 2 te 2t 

+ c 3 (t 2 

+ l)e 2 ', 


x 2 

= —c t e 2t — c 2 te 2t 

-ic 3 t 2 

e 2 '. 


*3 

= -c 2 e 2t - c 3 (t - 

3)e 2 ‘. 
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Section 8.2 

1. y=l + x + 2Y^-=— x — 1 + 2e x . 
n= 2 n\ 

, . . 2 4x 3 9x 4 

3. y = 2 + x + 2x 2 + — + — + •••. 


X X X X J 

5 ' > = ~2 + 6 _ + M + l20 + '"' 


7 , ,, x + x i + --- + .... 

n , .7 7(x — l) 2 14(x — l) 3 73(x — l) 4 

9. y = 1 + 2(x - 1) + 2 + 3 + 12 + 

„ (x - l) 2 (x - l) 5 

11. y=n+-^— + - 4Q- + -- 


Section 8.3 

v 2 y 2 y 4 y 2 y 4 y 6 

1. 0,(x)=l+y,^(x)=l+y + y,0 3 (x)=l+ T + T + — . 

Y 2 Y 2 Y 2 y ® y ^ ^ 

3. ♦,M- T .* J M- T + s .* J (x) = y + ^ + ^ + ij55 . 

e 2 * 1 

5. <£,(x) = e x - l,<^ 2 (x) = — -e* + x + -, 


e 4x e 3x xe 2x e 2 * 


x 3 x 2 x 107 


^ 3(X) = 76 ~T + ~T~ + T~ 2xe * + 2e * + T + T + 4~ 48 


.7 y 7 Y 1 2 ^ v 1 7 v 22 

z*2 / v\ v2 _l A\ l v\ — v-2 


7. 0i(x) = X 2 , 0 2 M = X 2 + — , 03 (x) = X 2 + y + — + 


+ 


833 7546 


Section 8.4B 

1. (a) 0.800, 0.650, 0.540, 0.462. (b) 0.812, 0.670, 0.564, 0.488. 
(c) 0.823,0.688,0.586,0.512. 


Section 8.4C 

1. (a) 1.239, 1.564, 1.995. (b) 1.238, 1.562, 1.991. (c) 1.237, 1.561, 1.989. 
3. (a) 1.112, 1.255, 1.445. (b) 1.100, 1.222, 1.375. 


Section 8.4D 

1. (a) 1.2374, 1.5606, 1.9886. (b) 1.2374, 1.5606, 1.9887. 
3. 1.1115,1.2531,1.4398. 


Section 8.4E 

1. 2.5447. 
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Section 9.1 A 


1 . 


2 


3. - (1 — e~ 2s ). 
s 



Section 9.1 B 

s 2 + 2 a 2 


1 . 


s(s 2 + 4a 2 )’ 

3. if {sin 3 at} = 


6a 3 


if {sin 2 at cos at} = 


(s 2 + a 2 )(s 2 + 9a 2 )’ 

2a 2 s 


(s 2 + a 2 )(s 2 + 9a 2 )’ 


7. 

9. 

11 . 

13. 


s + 5 


s 2 + 3s + 2’ 

2 

(s - a) 3 ' 

2s(s 2 - 3b 2 ) 
(s 2 + 6 2 ) 3 ' 

6 


(s - a) 4 


Section 9.1 C 

5e" 6s 
1 . — — . 
s 

3. -(l-e~ 6s ). 
s 

5. ~(e- 5s - e~ 7s ). 
s 

1 +e~ 2i + e~ 4i -3e~ 6s 

s 

2(1 -c~ 3s + e~ 6s ) 
s 
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Section 9.2A 


2 sin 3 1 

* ~~Y~‘ ’ 

, 5 t 3 e 2 ' 

1 — 

5. e~ 2 ' cos yfet. 


n 4f sin 2t + 3 sin 2f — 6f cos 2f 
' 16 _ 

11 . 

11 

15. — 4e _2 ‘ + 5e -31 . 

17. 7 cos 3t + 4 sin 3t. 


19. 


/(0 = 


(0, 0 < t < n, 

| — 5 cos 3t - 2 sin 3t, t > n. 


21 . 

23. 


m= 


JO, 0 < t < rc/2, 

\e~ 2(t ~ Kl2) \l cos 3t - sin 3 1], t > nil. 


m = 


0, 0 < t < 4, 
t - 4, 4 < t < 7, 
3, f > 7. 


25. 


/« = 


j sin 2t, 0 < t < n. 
sin 2 1, t > n. 


27. 


f(t) = 


e‘ sin 2t, 0 < t < n/2, 

(1 — e~* l2) )e' sin 2t, t > rc/2. 


Section 9.2B 

1 . e~ 2 ‘ — e~ 3( . 
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3. (1 — cos 3t)/9. 

5. (-1 +3; + <r 3 ‘)/9. 

Section 9.3 

« 3e x + e 3x 


3. y = e 2f 

5. y = 2 — 2 cos 2t + 3 sin 2 1. 

7. y = 2e 2 * — 3^ _r - e _r sin 3 1 + e~ x cos 3 1. 

9. y = e~ 2t — e~ 3x — 2 te~ 3i . 

11. y = (3 — 4 t)e x + sin t — 3 cos t. 

13. y = — 2e x + e lx + 1, 0 < t < 4; 

y = 2(e~ 4 — l)e x + (1 — e~ 8 )e 2t , t > 4. 

15. y = ^[1 + e~ 2x (3 sin t — cos t)], 0 < t < tc/2, 

e~ 2t 

y = -y- [(e* + 3)sin t — (2e* + l)cos t], t > tc/2. 

17. )/ = — t + 271 + | sin 2 t + (2 — 2tt)cos 2 t , 0 < t < 2 n , 

y = (2 — 2tc)cos 2t, t > 2n. 

Section 9.4 

i e * e ~ l ^ It e ' e ~ l 2 1 

• x= ~2 + ~T + 1 ,y= 2 + ~T~ ' 

3. — 2e‘ + 5e*‘, y = —4e' + 4e 4 '. 

5. x = 3 + 2 £ + f r 3 , y = 5 + 5t - 2t 2 + f t 3 . 

7. x = 8 sin t + 2 cos t, 

e x e~ x 

y = — 13 sin t + cos t + — — — . 

2 2 

9. x = — te*, )/ = — ^e -2 ' + te x . 

Section 10.2 

1. (a) There exists a solution </> on some interval |x — x 0 | < h about x = x 0 
(b) There exists a unique solution on |x — x 0 | < h. 

1 n 

3. (a) |x| <-; y = tan x, |x| < — . 

, , , 1 ln(l - 2x) 1 

(b) |x|<— ;y= 2 ’ -°° <x< 2 ' 

Section 10.4 

1. (a) Yes. (b ) d Jy, y , d Jl-, > , d J±-x 2 + y t y 2+ yl 


3. Theorem 10.8 
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Section 11.2 

1. Yes. 
3. No. 
5. Yes. 


Section 1 1 .3 

1. See Example 1 1.10 in the text. 

3. (b) x x = (1 l/4)e 3 ' + (9/4)<T 3 ', x 2 = (55/4)e 3 ' - (27/4)<?‘ 
5. (b) Xi = 6e\ x 2 = 6e l — 3e 5 ',x 3 = 3e' — e 51 . 

7. (c) B = CAC -1 . 


Section 1 1 .4 

1. (a) 

3. (a) 
5. (a) 
7. 


e 3t 

3e 4 '\ 

e 3t 

2c 4 '/’ 

e~ 

' 2e 4( \ 

— e~ 

' 3e M ) 

' 

e-\ 

— 2e 

' 2e 5 '/ 


(b) 



(b) 

(b) 


f e‘ + le 2t \ 

\ — 3e' + e 2, J 

COS t 

sin t — 3 cos t 


-6e~ t + 3e~ 2t -h4e~ 3t ^ 
—9e~‘ + 4e~ 2 ‘ + 3e~ 3t * 


Section 1 1 .5 


e sin t cos t 


1. (a)/ t «- T - 2 • 2 • 

f 2 (t) = sin t, 

r ^ e l sin t cos t 
hv) = ~2 2 2~* 

(b) + 7/2(0 + 2/3(0. 

d 2 y 

W 2 


9. ^4 - y = 0; 3; = e\ u = e x = exp( - 


t 2 -t 


x — expl — 


I 3 + 3t 


Section 1 1 .6 

5. (a) x = f 3 . (b) x = c t t 3 + c 2 te'. 


Section 11.8 


d 2 x dx 


1. (a) t 2 2 "h t “7 — h 2x = 0. 
' dt 2 dt 


3t 
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(b) (2 1 + 1) -jjY + (4 — £ 3 ) — + (1 - 3t 2 )x = 0. 

( C ) ^2" “ + ^)X = 0- 

(d) £ 3 t 4 + (* 3 + 8f 2 - 0 T" + (4* 2 + 1 It - 2)x = 0. 

at at 

5. (b) u = —t, x = e~\ x = c l e~ t + c 2 e~ t f e 2t t~ l dt. 


Section 1 2.1 

1. X - 4 n 2 ( n = 1, 2, 3 ,...); y = c n sin 2nx ( n = 1, 2, 3,...)* 

3- A = (n = 1,2,3,.. .); y = c„ sin ^ (n = 1, 2, 3, . . .). 

5. X = a 2 , where a„ (n = 1, 2, 3, . . .) are the positive roots of the equation 
a = tan rca; y = c n sin a n x (n = 1, 2, 3,...). 

7. X = n 2 (n = 1, 2, 3,...); y = c„ sin(n ln|x|) (n = 1, 2, 3,...). 

9. X = 16n 2 (n = 1/2, 3,...); y = c n sin(4n arctan x) (n = 1,2,3,...). 


Section 1 2.3 


3. if i^<zi): sin( „ lnW) ,if 

n n = 1 n n n =i In - 1 
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,, 1 1 » fl -(-1)"] . 

2 + J.S,[ n J S ” 

,3. L + Lf I ' . ' 

2 n n ^i L n J 


. nnx 1 2 J2, 1 . (2 n — l)nx 

sm— = - + - £ x r sin . 

3 2 n n = i 2n — 1 3 


. nnx L 2L ® 1 . (2 n — 1 )ttx 

sin-—- = 77 + — > -sin . 

Lin n = i 2n — 1 L 


, 2aL « (-1)" +1 . rax 

15. b + Y ^ sin . 

n n = i n L 


Section 1 2.4C 


1. (a)- 


2 - fl -(-1)"1 . 

- I si 

n n = i n j 


4 ^ sin(2n - l)x ( 

sin nx = - ) — — . (b) 1. 

n n ^ 2n-\ w 


^ / 4 * cos(wc/2) — cos nn . 

3. (a) - Y — — — sin nx. 

n „ = i n 

n v , 4 JS, sin(nn/2) 4 ® (-1)” ^ 

(b) 1 y cos nx = 1 + - y cos(2n — l)x. 

7C „ = i n n „=i 2n — 1 


„ . z. *+ > Lua 

5. (a) sin x. (b) - - - £ -j-j 


2 4 * cos 2nx 

7i 7t „ = i 4n 2 — 1 ' 


_ . 4 L *; 1 -(-1) . U7CX 8L 1 

7. (a) — > sin — — = — V sir 

it » = i n L n „ = i 2 n — 1 


(2n - l)7tx 


(b) 2L. 

„ , 2 L 2 i cos(>ra/2) — cos wt . nnx 

9. a) X — s,n ~r- 

n „ = i n L 

, L 2 2L 2 sin(n7t/2) nrtx 

(b) -r Y — — — ■ cos — “ — 

2 n n = l n L 

L 2 2 L 2 * (-1)" (2n — l)7tx 

= T + — , 5,2^1 cos —i—' 


, 4 L 2 £ ri -<-1)0 . nn> 

l 2 2 l 2 i n+t-iri 

,b) 


. nnx 8 L 2 JS, 1 . (2n — l)nx 

sin — — = — t~ > — iTT sin = . 

L 7t 3 (2 n - l) 3 L 


nnx L 2 L 2 ® 1 2ttrcx 

cos — - = — j L — cos — — . 

L 6 tt n = i n L 


Section 12.4D 

1. At x = 0, series converges to at x = njl, to (3n + 2)/4; and at x = n, to n. 


Section 13.1 

1. (a) (0, 0), (1, 1); (b) ^ = * _ y 2 ; (c) x 3 - 3 xy + y 3 = c. 

3. (a) x = e',y = 3e' + te‘; (b) x = e'~ 3 , y = te'~ 3 ; 

(c) The equation of the common path is y = x(ln |x| + 3); 

(d) The family of paths is y = x(ln |x| + c). 



ANSWERS TO ODD-NUMBERED EXERCISES 799 


Section 13.2 

1. Saddle point, unstable. 

3. Saddle point, unstable. 

5. Center, stable. 

7. Node, unstable. 

9. (a) Saddle point; (b) x = c^e 21 + c 2 e~ 2 \ y = c^e 2t — 3 c 2 e~ 2t . 

(c) The general solution is 3x 2 - 2xy — y 2 = c. 

Section 13.3 

1. Node, unstable. 

3. Saddle point, unstable. 

5. Saddle point, unstable. 

7. (a) Saddle point, (b) y 2 = 2(x + e~ x + c). (d) x 2 — y 2 = c. 

9. (a) Critical points are (- 1, 0), (0, 0), (1, 0), (2, 0), and (3, 0). 

(d) (0, 0) and (2, 0) are centers; the other three are saddle points. 

11. (a) Critical points are (- 1, 0), (0, 0), (1, 0), and (2, 0). 

(d) (0, 0) is a center; (—1, 0) and (2, 0) are saddle points; (1, 0) is a “cusp.” 

13. Saddle point if a < — 12; node if — 12 < a < 4; spiral point if a > 4. 

15. (a) (0, 0), saddle point, unstable; (f, — ^), spiral point, unstable. 

(b) (0, 0), node, asymptotically stable; (2, 6), saddle point, unstable; (3, 12), 
node, unstable. 

17. (a) X 0 = 9. 

(b) Center at (1, 0); saddle point at (5, 0). 

(c) “Cusp” at (3, 0). 

19. (a) E(x , y) = x 2 + y 2 \ asymptotically stable. 

(c) E(x , y) = x 2 + y 2 ; asymptotically stable. 


Section 1 3.4 


,a) s =r, 4 - r 2 ) 'f' 4 ' 


. . 2 cos(4t - t 0 ) 2 sin(4t - t 0 ) 

(c) x = — . - -, v = — 

yj 1 + ce 8r yj\ + ce 8t 

x = 2 cos(4t - t 0 ), y = 2 sin(4t - t 0 ). 

3. Equation has periodic solutions. 

5. Equation has no nontrivial periodic solutions. 
7. Equation has no nontrivial periodic solutions. 


Section 13.5 

1. x = a 0 sin 

3. x = 


5 jiao + 16 co : 
16 o 


+ 0 O |. 


2 a 0 . r In 

= —====== sin cot + — 

y/2o\\i(o 2 t + 4 L 

— • l~ 3(Iq€ 


2^-ni 


(3alfico 2 t + 4) 
2co 3 

. 1 


+ 
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Section 14.1 

1. u = f(y + x) + g{y + 6x); hyperbolic. 

3. u = f(y - 3x) + xg(y — 3x); parabolic. 

5. u = f(y + ix ) + g(y - ix); elliptic. 

7. u = f (y) + g(y + 2x); hyperbolic. 

9. u =f(y - ^ + g(y + hyperbohc. 

11. (b) Hyperbolic at all (x, y) such that y < x 2 /4. 


scuiun 


LIVI ■ I 

1. y(x, t) = sin 2x cos 2t. 

® . nnx 2nnt 

3. y(x, t) = 2. sin —— cos where 
w=i 3 3 

12 . nn nn 

b„ = -j-t sm — cos — (n = 1, 2, 3,...) 
irrr 2 6 

= bj2>ln 2 n 2 (n = 1, 7, 13,...); -6^/3/n 2 n 2 (n = 5, 11, 17,...); 
0 (n even or n = 3, 9, 1 5, ... ). 

5. n(x, t) = £ sin e ~n 2 n*t/L 2 , w h ere 

2 




H7CX 

/(x)sin-^— dx (n = 1, 2, 3,...). 


00 

7. u(x, y)= £ a„ sin nx sinh n(y - n), where 

n = 1 


a " 7t sinh /i7c J o 


/(x) sin nx dx (n = 1,2,3, 


Section 14.3 

d 2 u 


1. 

3. 

5. 

7. 


crj 

d 2 u 
c£> crj 

£~ 


+ 


= 0. 


£u 

1 Su 

£q 

~2 fri 

c u 

cu 

ct] 2 - 

dl 


cu 


1 ( cu cu ) 

cq £rj 6 \c£ + £rj) 

P2„ p2„ 2 


^ c 2 u £ 2 u 2 

9. TTo "h “ — 2 = — A 
cc ci/ 9 


Section 14.4 

3. y = 2x + c 1? y = — 4x + c 2 . 
5. No real characteristics exist. 
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Abel-Liouville formula: 
for linear systems, 528 
for «th order linear equations, 558 
Abel’s formula, 578 
Adjoint equation, 573 
Amplitude, 183 
Analytic function, 222 
Applications: 

to electric circuit problems, 211, 282 
to geometric problems, 70 
to mechanics problems, 77, 179, 278, 655, 668, 
674, 724 

to mixture problems, 94, 285 
to population growth, 91 
to rate problems, 89 
Approach, of path to critical point, 637 
Approximate methods, first-order equations: 
graphical, 377 
numerical, 394 
power series, 384 
successive approximations, 390 
Asymptotic stability of critical point, 643, 653, 
661, 682, 686 
Autonomous system, 633 
Auxiliary equation, 1 26 

Beats, 205 (Exercise 9) 

Bendixson’s nonexistence criterion, 695 
Bernoulli differential equation, 54 
Bessel functions: 


jf first kind, order one, 259 
of first kind, order p, 259 
of first kind, order zero, 254 
of second kind, order p, 262 
of second kind, order zero, 256 
Bessel’s differential equation: 
applications to partial differential equations, 737 
of order p, 252, 256, 586 (Exercise 11) 
of order zero, 253 
Boundary- value problem, 16, 588 

Canonical forms, of partial differential, equations, 
743, 746, 750, 753, 756 
Cauchy- Euler differential equation, 164 
Center, 638, 652, 660 
Characteristic equation, 301, 340, 360, 645 
Characteristic function: 
definition of, 592 
existence of, 595 
expansion in series of, 603 
orthogonality of, 598 
orthonormal system of, 602 
Characteristic roots, 301 
Characteristics, of partial differential equations, 
761, 764 

Characteristic values: 
of matrix, 337, 340, 360 
of Sturm- Liouville boundary- value problem, 592 
Characteristic vectors, 337, 340 
Clairaut’ s equation, 68 (Exercise 20) 
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Companion vector equation, 545 
Comparison theorem of Sturm, 581 
Complementary function, 121 
Continuation of solutions, 484 
Continuing methods, 395, 407 
Convergence: 
of sequence, 461, 464 
of series, uniform, 468 
of trigonometric Fourier series, 624 
Convolution, 437 
Critical damping, 191 
Critical point 
approached by path, 637 
criteria for determination of type of, 645, 654 
(Table 13.1), 660 
definition of, 634 
entered by path, 637 
index of, 700 
isolated, 637 
of linear system, 644 
of nonlinear system, 658 
stability of, 642, 653, 661, 682, 684 
types of, 638 

Damped oscillatory motion, 1 89 
Damping, 187 
Damping factor, 190 
Damping force, 181 
Dependence of solutions: 
on initial conditions, 488 
on a parameter, 678 
Determinants, fundamentals of, 771 
Differential equation: 
analytic significance of, 1 1 
definition, 3 

geometric significance of, 1 1 
Differential operator, 268 
Direction field, 378 
Domain, 469 

Entering, of path to critical point, 637 

Equidimensional differential equation, 164 

Euler method, 395 

Euler’s constant, 256 

Even function, 611 

Exact differential, 27 

Exact differential equation, 27 

Existence theorems: 

for characteristic values and functions of 
Sturm-Liouville Problems, 595 
for closed paths of nonlinear systems, 698 
for first-order initial- value problems, 19, 475 
for Laplace transforms, 416 
for linear differential equation with given 
fundamental set, 563 
for linearly independent solutions: 
linear systems, 295, 357, 361 
wth-order linear equations, 109, 553 
second-order linear equations, 1 12, 223, 240 


for linear systems, initial-value problems, 292, 
500, 509 

for nonlinear systems, initial-value problems, 496 
for «th- order linear equations, initial-value 
problems, 103, 501, 543 
for /jth-order nonlinear equations, initial- value 
problems, 498 

for periodic solutions of nonlinear equations, 703 
Expansion in series of orthonormal functions, 603 
Exponential order, functions of, 415 
Exponents of differential equation, 237 

Focal point, 640 
Forced motion, 199 
Fourier coefficients, 608, 611 
Fourier cosine series, 620 
Fourier series, relative to orthonormal system, 608 
Fourier series, trigonometric: 
convergence, 624 
definition of, 61 1 
of even function, 611 
examples illustrating, 613 
of odd function, 612 
Fourier sine series: 

applied to partial differential equations, 729, 735 
definition of, 619 
Free damped motion, 189 
Free undamped motion, 182 
Frequency, natural, 184 
Frobenius, method of, 236 
Fundamental matrix, 517 
Fundamental set of solutions, 110, 517, 553 

Gamma function, 258 
Graphical methods, 377 

Half-path, 697 

Heat equation, 721, 742 (Exercise 5) 
Homogeneous differential equation: 
first- order, 42 

linear, higher, 102, 106, 125, 543, 558 
linear partial of second-order, 717 
linear systems, 290, 292, 505, 510 
Homogeneous function of degree n, 43 
Hooke’s law, 180 

Improper integrals, 416 
Index: 

of critical point, 700 
of curve, 700 
Indicial equation, 237 
Initial- valve problem: 

for first-order ordinary differential equations, 

17, 473 

for linear partial differential equations, 757, 759 
for a th- order linear ordinary differential 
equations, 1, 3, 501 

for rtth- order ordinary differential equations, 498 
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for systems of linear ordinary differential 
equations, 292, 500, 510 
for systems of nonlinear ordinary differential 
equations, 496 
Iritegral curve, 11, 377 
Integral equation, 474 
Integrating factor, 
definition of, 35 

for first-order linear differential equations, 50 
methods of finding, 61 
special types of, 63 
Inverse Laplace transform, 431 
Irregular singular point, 234 
Isocline, 380 

Kirchoff s laws, 212, 282 

Kryloff and Bogoliuboff, method of, 707 

Laplace differential equation, 721, 731, 742 
(Exercise 7) 

Laplace transform: 
convolution, 437 
definition of, 41 1 
of derivatives, 419 
existence of, 416 
inverse transform, 431 
linear property, 418 
of periodic functions, 428 
properties of, 418 

solution of linear differential equations by, 441 
solution of systems of linear differential 
equations by, 453 
of step functions, 424 
table of, 434 

of translated functions, 426 
translation property, 421 

Legendre’s differential equation, 233 (Exercise 18), 
575 

Liapunov function, 684 
Liapunov’s direct method, 682 
Lienard- Levinson- Smith Theorem, 703 
Limit cycle, 692 
Limit set, of half-path, 698 
Linear algebraic systems, theorems on, 171, 332, 
559 

Linear combination, 106, 293, 316 
Linear dependence: 
of constant vectors, 332 
of functions, 107 
of solutions, 294, 549 
of vector functions, 334 
Linear independence: 
of constant vectors, 333 
of functions, 108 
of solutions, 109, 294, 549 
of vector functions, 334 
Linear ordinary differential equations: 
first-order, 49 
higher-order: 


companion vector equation of, 545 
constant coefficients, 125, 441 
definition of, 5, 102 
existence theorems for, 103, 501, 543 
general solution of, 110, 122, 501 
homogeneous, 102, 106, 125, 510, 522, 543 
Laplace transform solution of, 441 
nonhomogeneous, 120, 137, 155, 569 
reduction of order, 116, 565 . 
relation to linear system, 267, 544 
theory of, 102, 543,558, 569 
second-order, 
applications of, 179 
general, 103, 107, 112, 116, 221 
self-adjoint, 574 
Sturm theory, 577 
theory of, 170 

Linear partial differential equation, 717 
Line element, 378 
Lipschitz condition, 470 
Logarithmic decrement, 190 

Matrix: 
adjoint of, 327 

characteristic equation of, 340, 347 
characteristic values of, 337, 340, 347 
characteristic vectors of, 337, 340 
coefficient matrix, 347, 356 
cofactor of an element of, 326 
definitions, basic, 312-315 
derivative of, 329 
fundamental matrix, 517 
identity matrix, 324 
inverse of, 325 
minor of an element of, 326 
multiplication of, 322 
nonsingular, 325 
product of, by vector, 317 
symmetric matrix, 341 
transpose of, 327 
Method: 
of Euler, 395 
of Euler, modified, 399 
of Frobenius, 236 
of isoclines, 380 
of Kryloff and Bogoliuboff, 707 
of Liapunov, direct, 682 
of Milne, 407 
of Runge-Kutta, 403 
of separation of variables, 722 
of successive approximations, 390 
of power series, 384 
of Taylor series, 384 
of undetermined coefficients, 137, 386 
of variation of parameters, 155, 537, 570 
Milne, method of, 407 
Modified Euler method, 399 

Negative definite function, 682 
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Negative semidefinite function, 682 
Newton’s second law, 77, 182 
Node, 641, 645, 648, 660 
Nonhomogeneous differential equation, 

higher-order linear, 102, 120, 137, 155, 569 
Nonlinear differential equation, 5, 632 
Normalized function, 601 
Normalized linear differential equation, 543, 563 
Numerical methods, 394 

Oblique trajectories, 74 
Odd function, 611 

Operator method for linear systems, 270 
Order, of a differential equation, 5 
Ordinary point, 222 
Orthogonal functions, 597 
Orthogonal trajectory, 70 
Orthonormal system, 601 
Overcritical damping, 1 92 

Partial differential equations: 
canonical forms of, 743, 746, 750, 753, 756 
characteristics of, 761, 764 
classification of, 720 
elliptic, 720, 753 
homogeneous linear, 717 
hyperbolic, 720, 746 
infinite series of solutions of, 7 1 7 
initial- value problem for, 757, 759 
parabolic, 720, 750 
separation of variables in, 722 
Particular integral: 
definition of, 121 

methods of finding, 137, 155, 570 
superposition principle for, 572 
Path, 633 

Pendulum problem, 187 (Exercise 8), 674 

Periodic function, 624 

Periodic motion, 183 

Periodic solutions, 692, 703 

Period of motion, 184, 190 

Phase constant, 1 84 

Phase plane, 633 

Picard method of successive approximations, 390 
Piecewise continuous function, 414 
Piecewise smooth function, 625 
Plucked string problem, 730 
Poincare-Bendixson theorem, 698 
Pointwise convergence, 462 
Positive definite function, 682 
Positive semidefinite function, 682 
Power series solutions, 221, 384 

Recurrence formula, 228, 239 
Reduction of order, 116, 565 
Regular singular point, 324 
Resonance, 206 
Restoring force, 180 


Riccati equation, 59 (Exercise 38), 584 
(Exercise 5) 

Runge-Kutta method, 403 

Saddle point, 640, 646, 660 
Self-adjoint equation, 574 
Separable differential equation, 39 
Separation of variables, method of, 722 
Separation theorem, of Sturm, 580 
Separatrix, 674 
Series solutions: 

of first-order differential equations, 384 
about ordinary point, 221, 223 
about regular singular point, 233, 235, 240 
of second-order linear differential equations, 221 
Simple harmonic motion, 183 
Singular point, 222 
Solution: 
definition, 8 
explicit, 8 

of first-order initial- value problem, 17, 473 
formal, 10 

fundamental set of, 1 10, 517, 553 
general, «th- order linear differential equations, 
110,121 

general, system of linear differential equations, 
273, 296, 298, 520, 535 
geometric significance of, 1 1 
implicit, 8 

of ttth-order ordinary differential equation, 8, 497 
one- parameter family of, 10 
of partial differential equation, 715 
of system of linear ordinary differential 
equations, 265, 272, 290, 356, 508 
of system of ordinary differential equations, 495 
Spiral point, 640, 650, 660 
Spring constant, 180 

Stability of critical point, 642, 653, 661, 682 
Starting methods, 394 
Steady state term, 201 
Step function, 424 
Strip condition, 759 
Sturm comparison theorem, 581 
Sturm-Liouville problem: 
application of, to partial differential 
equations, 726, 733 
definition of, 588 

properties of solutions of, 595, 598 
Sturm separation theorem, 580 
Sturm theory, 577 
Superposition principle, 572 
Systems of linear differential equations: 
applications of, 278 
coefficient matrix of, 347, 356 
constant coefficients, 270, 301, 346, 355 
critical points of, 644 

existence theorems for solutions of, 292, 295, 
349, 352, 362, 500, 509,518 
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fundamental matrix of, 5 17 
fundamental set of solutions of, 5 1 7 
general solution of, 273, 296, 298, 520, 535 
homogeneous, normal form, 290, 292, 505, 510 
Laplace transform solution of, 453 
matrix method for solving, 346, 355 
nonhomogeneous, normal form, 290, 297, 505, 
533 

normal form, 266, 505 
operator methods for, 270, 275 
relation to single higher- order linear equation, 
267, 545 
scalar form, 507 
solution of, 272, 290, 508 
theory, normal form, 297, 499, 505 
types of, 264 

vector form, 346, 356, 507, 545 
Systems of nonlinear differential equations: 
autonomous, 633, 658 
conservative, 668 
critical points of, 634, 658, 673 
derivative of a function with respect to, 682 
examples of, 662 
for n first-order, 495 
without linear terms, 667 

Taylor series method, 384 
Total differential, 26 
Transient term, 201 
Trivial solution, 105 

Two-point boundary- value problem, 16, 588 


UC function, 141 
UC set, 142 

Undetermined coefficients, method of: 
for first-order differential equations, 386 
for rtth-order linear differential equations, 1 37 
table for, 143 

Uniform convergence, 464, 468 
Uniqueness of solutions, 19, 103, 475, 482 
Unit step function, 424 

Van der Pol equation, 704, 712 
Variables separable, equations with, 39 
Variation of parameters, 155, 537, 570 
Vector 

constant vector, 319 
definitions, basic, 313, 314 
derivative of vector function, 319 
integral of, 319 
linearly dependent, 332, 334 
linearly independent, 333, 334 
product of matrix by, 317 
vector differential equation, 346, 356, 507, 545 
vector function, 319 
Vibrating string problem, 724 
Vibrations of mass on spring, 179 

Wave equation, 720, 724 
Weierstrass M-test, 468 
Wronskian, 111, 296, 358, 513, 551 

Zeros of solutions, 260, 579, 595 
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